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PREFACE 


This course of analysis is intended for mathematical specialists in 
their second and third years at Universities. We assume familiarity 
with the concept of a limit and its applications to infinite series and 
to the differential and integral calculus. The contents of A First 
Course in Mathematical Analysis by one of us (J.C.B.), which do not 
include Cauchy sequences, upper and lower limits or uniform con- 
vergence, form a suitable foundation for this book. From time to 
time we shall need to refer to a basic work on analysis and for 
simplicity we shall refer to the First Course, shortening its title 
to-Cl. 

An undergraduate after his first year should be ready for a more 
abstract setting and prepared to think in metric spaces instead of the 
Euclidean line or plane. The study of metric spaces provides not only 
a means of unifying different topics in analysis but also a natural 
link with topology. 

Chapters 1-9 concentrate on general analysis and real functions 
and 10-14 on complex functions. These last five chapters are largely 
independent of 6-9, and the reader who wishes to reach Cauchy’s 
theorem quickly needs little more than §8.1 and §8.2. 

After careful thought we decided to treat the Riemann and 
Riemann-Stieltjes integrals fully (chapters 6 and 8) and to leave the 
Lebesgue integral out. For many purposes the Riemann integral is 
sufficient, and the inclusion of an adequate account of Lebesgue 
measure and integration would impair the balance of the book. 

We are very grateful to Dr L. Mirsky and to Professor G. E. H. 
Reuter, each of whom has read the entire manuscript. The final form 
of it owes a great deal to their care and vigilance and to the experience 
which they gained from their own teaching. 

J.C.B. 
August 1968 H.B. 


1 
SETS AND FUNCTIONS 


1.1. Sets and numbers 


It has been recognized since the latter part of the nineteenth 
century that the idea of number (real and complex), and therefore all 
analysis, is based on the theory of sets. In modern analysis the 
dependence is explicit, for the language and algebra of sets are in 
constant use. 

The reader is likely to be familiar with the intuitive notion of a set 
and with the basic operations on sets. In this section we therefore 
confine ourselves to fixing the terminology and recapitulating the 
results that will be used subsequently. The notes at the end of the 
chapter refer to books which develop set theory systematically from 
explicitly stated axioms. 

There are synonyms for the word set such as collection and space. 
The members of a set are also called its elements or points. The 
statement that a belongs to (or is a member of) the set A is written 


aca, 
If a does not belong to A we write 
a¢ A. 


We denote by @ the empty set, namely the set which has no 
members. 


Inclusion of sets. Suppose that every member of the set A also 
belongs to the set B, i.e. that 


xEA>xXEB. (1.11) 
Then we say that A is contained in B and write 
aos of. A> A: 


the set A is said to be a subset of B. Note that any set is a subset of 
itself. Also the empty set is a subset of every set; for (1.11) is logically 
equivalent to ee Be 


and, when A is the empty set, this implication clearly holds for any 
set Z. 
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If A < Band BC A, Le. 
xEAsxeB, 
then A and B have the same members and we write 
A = B. 
If A < Band A + B, we can call A a proper subset of B. 


Union and intersection. Given any two sets A, B, the set which 
consists of all the elements belonging to A or to B (or to both) is 
called the union of A and B and is denoted by 


AUB. 


The set consisting of the elements which belong to both A and B is 
called the intersection of A and B and is denoted by 


ANB. 


The sets A, B are said to be disjoint if they have no elements in com- 
mon, ie. if ANB= @. 


Theorem 1.11. The following identities hold for any sets A, B, C. 
(i) AUB= BUA, ANB= BNA; 
(ii) AU(BUC) = (AUB)UG, An(BnC) = (An BNC; 
Gi) A U(BNC) = (AUB)N(AUQ), 
ANn(BUC) = (An B)U(ANC). 


Proof. We illustrate the argument used for establishing these 
identities by proving the first result in (111). 

Let x be any element of AU(BnN C). Then xe A or xE BNC. If 
xeA, then xe AUB and xE AUC, so that xe(AUB)N(AUC). 
If xe Bn C, then xe Band xe Cso that again x € (AU B)n (AUC). 
We have therefore shown that 


AU(BNC) < (AUB)N(AUO). (1.12) 


Now let yE(AUB)N(AUC). Then ye AUB and yEeAUC. It 
follows that y € A or that ye Band ye C, i.e. that ye A orye BNC. 
Hence y € AU(BN C) and so 

(AUB)N(AUC) ¢ AuU(BNC). (1.13) 


The inclusion relations (1.12). and (1.13) now yield the required 
identity. | 
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The laws of operation with U and fn on sets have some likeness 
to those with + and x on numbers. In fact U and f have replaced 
the signs for sum and product formerly used in the algebra of sets. 
The likeness is only partial: the identity A UA = A has no analogue 
in numbers; the second law in theorem 1.11 (iii) has an analogue, but 
not the first. . 

The definitions of union and intersection previously given may be 
extended. If @ is an arbitrary collection of sets, the union of these 
sets, denoted by U A 


Ac? 
is the set consisting of all those elements which belong to at least one 
of the sets A. The intersection of the sets of @, denoted by 
-) A, 


AEE 


is the set consisting of the elements which belong to all the sets A. 
We require one more operation with sets. If A, B are any two sets, 
the difference AB 


is the set consisting of those elements which belong to A but not to B. 
Note that the definition does not require B to be a subset of A. How- 
ever a particularly important case occurs when all sets under con- 
sideration are subsets of a given set X. Then the set X—A is called 
the complement of A (relative to X) and we shall denote it by A’. 
Clearly (A’)’ = A. 


Theorem 1.12. For any collection € of subsets of a set X, 
Oa) = fF): A and’ 1) A) = A. 
Ac@ Ace@ Ace@ AeE 


The proof is left to the reader. 

Theorem 1.12 shows that the operations LU and ( are comple- 
mentary and that an algebraic identity involving U and f) will admit 
of a dual identity obtained by interchanging U, (). For example, in 
each pair of identities in theorem 1.11 the second follows from the 
first. 


We shall take for granted that the reader is familiar with the systems 
of real and complex numbers. There are well known and simple 
methods for obtaining the complex numbers from the real numbers. 
For our purposes it is only necessary to postulate the existence of the 
system of real numbers as one satisfying certain axioms which are 
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given in C1 (11-12) and are restated in the notes at the end of the 
chapter. However, it is possible to construct the real numbers from 
much more primitive objects. Again the notes contain some remarks 
on the subject. 

The set of real numbers will be denoted by R1, the set of complex 
numbers by Z. Our notation for the open interval in R' which consists 
of the points x such that a < x < bis (a, b). The closed interval of 
points x such that a < x < bis written [a, b]. The intervals defined 
by a < x < banda < x < bare written [a, b) and (a, b]. The infinite 
interval [a, 00) consists of the points x such that x > a. Correspond- 
ing interpretations are put on the expressions (a, 0), (—%, a), 
(— oo, a]. Finally, (— ©, 00) is R?. 


Exercises 1 (a) 
1. Prove that 


Gi) AUB=AS+A@>B; Gi) ANB=AsSACB. 

2. Prove that 
(BUC)N(CU A)N(AU B) = (BN OC)U(CNA)U(AN B). 

3. Show that A—_B = An B’. Hence or otherwise prove that 

(i) (A—B)nN(A—-C) = A-(BUC) = (A-B)-C; 

(ii) (A—B)U (A—-C) = A-(BN QC); 

(iii) (A—C)n (B-C) = (An B)-C; 

(iv) (A—C)U(B—C) = (AU B)-C. 
4. Prove that 

(i) (A—B)NC = (ANC)-(BNO); 

(ii) (A—B)UC = (AUC)-(BUQC)S+Ce= @. 
5. Prove that 

Gi) (AUB)-B=A+ANB= Q; 

(ii) (A-—B)UB=AsA>B. 


6. The symmetric difference A A Bofthesets A, Bis defined as the set of elements 
in A or in B, but not in both. Show that 


A A B= (A—B)U(B—-A) = (AU B)-(AN B) = (AUB) N(A'U BY). 
Deduce that 
(i) AAB= @sA=B; Gi).A AB =AL B. 
7. Prove that 
Gi) AA B= BAA; Gi) AA BAC=AA(BAC). 
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8. Prove that 

(i) (AA BNC = (AnNC)A (BNO); 

(ii) (AA B)UC= (AUC) A (BUC) S+Ce= @; 

(iii) (AA B)U(AA COC) = (AUBUOC)N(VUB UC). 
9. Show that AA B=CAD+AAC=BAD. 


10. When E is a finite set (i.e. one with finitely many elements), denote by |E| the 
number of its elements. 
Show that, if the sets A, B are finite, then 


|A|+|B| = |Au Bi +|An Bl. 


1.2. Ordered pairs and Cartesian products 
A set with elements a, b, c, ... is often denoted by the symbol 


fa, by C, «.-}. (1.21) 


The notation calls for a number of comments. 

(i) In sucha listing of elements it is immaterial whether a particular 
element appears once or several times. This convention is purely a 
matter of convenience. For instance it allows us to denote the set of 
roots of a complex quadratic equation by {a, b}, whether a + 5 or 
L= 

(ii) The order in which a, b, c, ... are written in {a, b,c, ...} has no 
significance. For example {a, b} = {b, a}. 

(iii) It is important to distinguish between the object a and the 
set {a}, i.e. the set whose only member is a. Thus the set {2} has one 
element, while @ has none. 

The notation (1.21) has a useful variant. Given a set X, denote by 
P(x) a statement, which is either true or false, about the element 
x of X. The set of all x ¢ X for which P(x) is true is written 


{x € X|P(x)}, 
or simply {x|P(x)} 
if the identity of the set X is clear from the context. 


Illustrations 
(i) {xeZ[x2+1 = 0} = fi, -2}; 
(ii) {xe Rx?+1 =O = @; 
(iii) {x € R'|a < x < 5} = [a, D). 
The notion of a set does not involve any ordering among the 
elements. For instance in the set {a, 5}, a and b have the same status. 


6 SETS AND FUNCTIONS [1.2 
The ordered pair (a, b) is the set {a, b} together with the ordering ‘first 
a, then 6’. Thus (a, 5) and (0, a) are different unless a = b. (The 
context will determine whether the expression (a, b) stands for an 
ordered pair or an open interval.) 

The intuitive concept of an ordered pair (a, b) may be formalized 
by the definition fs, By <n a 
(See exercise 1(d), 1.) 

An ordered set (a,, ..., a,.) of any finite number n (> 1) of elements 
is defined in a similar way. 


Definition. Given the non-empty sets X. to «s+3 Xp, their Cartesian 
eat He 5 OR: (1.22) 
is the collection of all ordered sets (x1, ..., X,,) such that 

My G Ay, os Age Ae 


ff X, =... = X,. = X, the set (1.22) is denoted by X"; X* is taken to 
be X. 

For n > 1, R® = R'x...x R'! (n factors) is the set of all ordered 
sets (X,, ..., X,) of real numbers. We define an interval in R” as a set 


i fe ee = 


where J,, ..., J,, are intervals in R'. For example [a, b] x [c, d] is the 
set of points (x, y) € R* such thata < x < bandc<y<d. 


1.3. Functions 

Let X, Y be two non-empty sets. A relation from X to Y is a subset 
of Xx Y. Ifa relation f, i.e. a subset of X x Y, is such that, for every 
x € X, there is one and only one member (x, y) of f, then fis said to be 
a function on X to (or into) Y; we express this symbolically by writing 


ff: X—> Y. 


The set X is called the domain of f. 

In some contexts the terms mapping, transformation, or operator 
are often used as synonyms for function. Let (x, y) be an element of 
the function f: X¥ — Y. Then we say that y is the value of f at x (or 
the image of x under f) and we write y = f(x). If Eis any subset of X, 


the subset of Y {ye Yly =f(x) and xeB} 


is called the image of EF under /; it is denoted by f(£). The set f(X) is 
called the range of f. Note that f(X) may be a proper subset of Y. 
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If a function f has its domain in X and its range in Y, we say that 
fis a function from X to Y. A function from R? to R? will be called a 
real function, a function from Z to Z a complex function. 


Illustrations 


(i) We obtain a function on X to Y whenever we have a rule for assigning a 
single value y in Y to each x of X. Thus the equation x*+y® = 1 defines a 
function on R! to R! whose range is (— 0, 1]. 

(ii) The equation x?+ y? = 1 defines a relation but not a function on [—1, 1] 
to [—1, 1] because, whenever — 1 < x < 1, it is satisfied by two values of y. 

(iii) The logarithmic function is the function on (0, 00) to R' which assigns 
the value log x to the positive number x. 

(iv) The sequence aj, dz, a3, ... is a function whose domain is the set of positive 
integers exhibited as suffixes. 

More generally, given two arbitrary non-empty sets A, Jand a function on / to 
A, we sometimes find it convenient to write the image of a member / of J in the 
form q;. 

(v) Denote by © the set of all real functions ¢ defined by an equation of the 


form 
d(x) = axt+fPh (xe R}), 


where «, f are real numbers. A function f: R? > ® is obtained by assigning to 
each point (a, b) € R? the function ¢,,, given by $,,(x) = ax+b (xe R’). 


Let f be a function on X into Y. If f(X) = Y, te. if every ye Y is 
the image under fof at least one x € X, then we say that fis surjective, 
or that fmaps X onto Y. 

If different elements of X have different images, i.e. if f(x) = f(x’) 
implies x = x’, then fis called injective. 

If f is both surjective and injective, so that every value yin Y is 
assumed once and once only, then fis said to be bijective. A bijective 
function is also called a bijection; it effects a one-to-one correspondence 
between its domain and range. 

Consider again the illustrations above. 

(i) The function maps R! onto (—©, 1]; it is not injective. 

(iii) The logarithmic function is a bijection on (0, oo) to R’. 

(v) This function is a bijection on R? to ®. 


Inverse functions. Suppose that the function f has domain X and 
range Y. To f, which is a subset of Xx Y, there corresponds the sub- 


set of Yx X defined as 
{(y, x)|(x, ¥) €F}. (1.31) 


Plainly this is a function with domain Y and range X if and only if f 
is bijective. When f is bijective, (1.31) is called the inverse function 
of f and is denoted by 1. For instance the logarithmic function has 
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an inverse with domain (—©0, 00) and range (0, 00); the inverse is 
the exponential function. 

If the function f: X > Y is injective, then f: X + f(X) is bijective 
and f-1: f(X) > X is a function from Y to X. For example the 
function f: R! > R! defined by 

| 1 
IO) = Tre 


has the inverse f-!: (0, 1) > R' given by 
| 
—(y) = lo (- — 7 ; 
ye a, ey 


It is clear that, if f has an inverse, then f—! has an inverse and 
(fy =f 

Inverse images. Let f: X — Y be an arbitrary function and let E 
be any subset of Y. We denote by f-1(£) the set 

{x € X| f(x) € E}. 

This set is called the inverse image of E. It is important to note that 
the definition of f-'(Z) does not presuppose the existence of the 
inverse function f-!. However, when f~ does exist, then f—1(£) is the 
image of F under f™. 

If f is the function in (i) on p. 7 then, for example, 


tO, 1}) ee (= L, 1); 
the function f—1 does not exist. 


Composition of functions. Given the functions f: X > Y and 
g: Y > W, the function h: X > W defined by 


A(x) = gtf(x)} (xe X) 
is called the composition of g and f and is written go f. 
For three functions f,: X¥, > Xo, fo: X2 > X3, fg:X3 > Xa, we 
clearly have fro (hoh) = (hohoh 


so that the expression f,0 f,0 f, has a meaning. The process of com- 
position may be extended to any number of functions. 
For a non-empty set A, denote by i, the identity function on A 


defined by Lx) = x (XE A). 
If, now, f: X > Y is any function, 

foix =f, ipof=f; 
and, when / is bijective, 


flof=ix, fof = iy. 


io FUNCTIONS 9 


Theorem 1.31. 

(i) If the function f: X > Y is surjective and if there is a function 
g: YX such that gof = ix, then f is bijective and 2. f. 

(ii) If the function f: X > Y is bijective and if the function g: Y > X 
is such that fog = iy, theng =f: 

Proof.* 

(i) If f(x) = f(%), then 

x = gif(x)} = s{f(x)} = x 
and so f is injective as well as surjective. We now have 
g = goiy = go(fof) = (gofjoft = ixof? =f". 
(ii) g = ixog = (flof)og =f10(fog) =froiy =f". | 


In theorem 1.31 (ii) the bijectiveness of f is not a consequence of 
surjectiveness, as it is in part (i). For instance let X = {a, b} (a = 4), 
Y = {c} and let f, g be defined by f(a) = f(4) = c, g(c) = a. Then f 
is surjective and fo g = iy, but fis not bijective. 

If the functions f: ¥ > Y, g: Y> W are bijective, then go/f: 
X > W is bijective. It follows from theorem 1.31, or it may be 
proved directly, that (go fy = f-t0 g71. 

Restriction and extension of functions. Suppose that X, > X, and 
that f: X, > Y, g: X,— Y are functions such that f(x) = g(x) for 
all x € X,. We then say that g is the restriction of fto X, and that fis 
an extension of g to X;. For example let f,: R! > R' and f,: R' > Rt 
ys Aye lal ee RD, 

The function g: [0, «) > R! defined by 
x= x (x > 0) 
is the restriction to [0, 0) of both ff and fZ; and fi, f2 are both 
extensions of g to R’. 
Exercises 1 (5) 
1. Show that {{a}, {a, b}} = {{c}, {c, d} if and only ifa = cand db = d. 
2. Prove that 
(i) (AU B)xC = (Ax C)uU (Bx ©), 
(ii) (AN B)x C = (Ax C)n(BxO), 
(iii) (A—B)x C = (Ax C)-(Bx CC). 
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3. Prove that Ax B = Bx A if and only if A = B. Is (Ax A)x A necessarily the 
same set as A x (Ax A)? 


4. Show that {a} x {a} = {{{a}}}. 
5. Let @ be a collection of subsets of a set X. Prove that, for any function 
f:X—> Y, 
@fCU A= U f(A), Gi) ACN AE N F(A). 
AEG AEG AEG AEG 


Show that, if fis injective, then identity holds in (ii); and that fis injective if 
f(A B) = f(A) n f(B) 
for all subsets A, B of X. 


6. Show that, if f: X¥ > Yis any function, then 


f(A— B) > f(A)—-f(B) 
for all subsets A, B of X; and that identity holds for all A, B if and only if fis 
injective. 
7. Let @ be a collection of subsets of a set Y. Show that, for any function 
f: XY, 


@ft(U D= Vf), @Mfr ON BD= Nf"? @. 
Be€ Be@ Be@ Be@ 
8. Show that, if f: X > Y is any function, then 


f-(C— D) = f-(C)—f-(D) 
for all subsets C, D of Y. 


9. Show that, if f: XY > Y is any function, then 


(i) f7(f(A)) > A, Gi) FFB) < B 


for all subsets A of X and all subsets B of Y. Prove also that in (i) identity holds 
for all A if and only if fis injective; and in (ii) identity holds for all B if and only 
if fis surjective. 


10. The functions f,: X, > X2, fp: X, > Xe, fg: X3—> X, are such that the 
compositions f,0 f,: X; > X3 and f,0 fy: X_ > X, are both bijective. Show that 
fis So, fz are all bijective. 


11. Let R bea relation from X to X (i.e. a subset of Xx X) and write xRy when 
(x, y) € R. The relation is said to be 
(i) reflexive if xe X > xRx; 

(ii) symmetric if xRy = yRx; 

(iii) transitive if xRy and yRz > xRz. 

Criticize the following ‘argument’. The relation R is known to be symmetric 
and transitive. Writing x for z in (iii) and using (ii) we obtain xRy > xRx. Hence 
R is reflexive. 


1.4. Similarity of sets 


If there is a one-to-one correspondence between two sets A, B, 
i.e. if there exists a bijection f: A > B, then the two sets are said to 
be similar and we write Ao 
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It is easy to see that similarity has the following three properties: 
(i) A ~ A (reflexivity); ; 

(1) if A ~ B, then B ~ A (symmetry); 

(iii) if A ~ Band B ~ C, then A ~ C (transitivity). 

If we confine ourselves to a given collection @ of sets, then simi- 
larity is a relation from @ to @ and, in view of (i)-(ili) above, it is an 
equivalence relation. The reader has probably met such relations in 
other branches of mathematics, particularly algebra. 

Finite sets are similar if and only if they have the same number of 
elements. This implies, in particular, that a finite set cannot be 
similar to a proper subset of itself. We shall see that infinite sets 
always contain proper subsets to which they are similar (theorem 1.42, 
corollary). A simple example is the following. Let A be the set of all 
integers and let B be the set of all even integers. Then A ~ B, since 
the function given by y = 2x (xeE A, y € B) establishes the required 
one-to-one correspondence. 

A set which is similar to the set {1, 2, ...! of positive integers is said 
to be countably (or enumerably) infinite. Such a set is therefore one 
that can be arranged as a sequence. A set is called countable if it is 
either finite or countably infinite. 

We now show that there are infinite sets which are not countable. 


Theorem 1.41. The set of real numbers in the interval (0, 1) is not 
countable. 


Proof. Suppose that the real numbers in the interval (0,1) do 
form a countable set. This set is certainly infinite and can therefore 
be arranged as a sequence 

Qi, Qo, Az, ecce 
Now represent each number a,, as an infinite decimal 
B= Oe Cian oe 
in which recurring 9’s are not used. Such a representation is unique. 


Define F ( when «,,, + 1, 


Then SO Pi Bo. 


is a real number between 0 and | which differs from a, in the nth 
decimal place. Hence 5 is not one of the numbers dj, a., a3, ... and 
this contradicts the original assumption that all real numbers in the 
interval (0, 1) appear in the sequence (a,). | 


2--when.-¢,,. = i. 
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Countably infinite sets are the ‘smallest’ infinite sets in the sense 
of the theorem below. 


Theorem 1.42. Every infinite set contains a countably infinite subset. 


Proof. Let A be an infinite set. Take some element of A and call 
it a,. The set A —{a,} is not empty; denote one of its elements by ay. 
This process may be continued indefinitely: at the mth stage the set 


A —{a, 00g is 


cannot be empty since A is infinite. The set {a,, ag, ...} is a countably 
infinite subset of A. | 


Corollary. Every infinite set contains a proper subset to which it is 
similar. 


Proof. Let A be an infinite set and let S = {a,, a3, ...} be a count- 
ably infinite subset of A. The function f: A > A—{a,} defined by 


[Ont i X= ein = 1,2...) 
x if xen S 
is a bijection. Thus A ~ A—{a,}. | 


I(x) = 


The next theorem is frequently used. 


Theorem 1.43, Every subset of a countable set is countable. 


Proof. Let A be a countable set and let B be a subset of A. When B 
is finite there is nothing to prove. Assume therefore that B is infinite, 
so that A is infinite and, being countable, can be arranged as a 
sequence (a,). Since every set of positive integers has a least member, 
there is an element a,, of B with least suffix n,. Next let a,, be the 
element of B—{a,,} with least suffix n,. The procedure may be 
repeated indefinitely since B is not finite. The sequence 


Gees Wao: (1.41) 


consists of elements of B and, in fact, exhausts B. For take any element 
of B. It is of the form a;,, since it is a member of A, and it must there- 
fore occur among the first k elements of the sequence (1.41). Hence B 
is countable. | 


Theorem 1.44. The set P? of all ordered pairs (p, q) of positive integers 
is countable. 
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Proof. The elements of P?, all of which appear in the array 
a 7 
Ge te ys ks 
2 OD, 


may be arranged as a sequence in several simple ways. Perhaps the 
simplest is the enumeration by diagonals: 


(i, 15 te G1). @, 2), 1.3; 
i) 4a = Le) a Os ed 


Exercise. Show that the set P” (n > 2) of all ordered sets (pj, ..., p,) of m positive 
integers is countable. 


Theorem 1.45. The union of a countable collection of countable sets 
is countable. 


Proof. Let {A,, Ag, ...} be a finite or countably infinite collection 
of countable sets. If 


n—1 
B, = Ay, B,, = ye U A, (n > 1), 
k=1 
then the B,, are mutually disjoint and 
say. (The non-committal notation U is used since the number of sets 


may be finite or infinite.) By theorem 1.43 each of the sets B, is 
countable. Therefore, if B,, is not empty, its elements may be arranged 
in a possibly terminating sequence 


Bia, ‘eee eoee 


Now let S be the subset of P? which contains a pair (p, q) if and only 
if B,, is either infinite or has at least g members. By theorems 1.43 and 
1.44, S is countable. Since there is an obvious one-to-one correspon- 
dence between S and C, it follows that C is countable. | 


Theorem 1.46. 


(i) The set of all rational numbers is countable. 
(ii) The set of all irrational numbers is not countable. 
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Proof. A given positive rational number is uniquely representable 
in the form p/g, where p,q are coprime positive integers. With this 
number we now associate the ordered pair (p,q). Thus the set of 
positive rational numbers is similar to a subset of P? and is therefore 
countable. Part (i) now follows easily by use of theorem 1.45. 

Since the set of real numbers in (0, 1) is not countable, by theorem 
1.43 the set R! of all real numbers cannot be countable. If now the set 
of irrational numbers were countable, (i) and theorem 1.45 would 
show R} to be countable. | 


Instead of considering all rational and irrational numbers we may 
restrict ourselves to those in any given interval. It follows immediately 
from theorem 1.43 that the analogue of (i) holds. The analogue of 
(ii) can then be proved as before by using exercise 2 below. 


Exercises 1(c) 
1. Let C be a countable set. Show that, if A is any infinite set, then 


AUC~ A; 
and that, if B is an uncountable set, then 
B-Cw~ B. 


2. (i) Show that all finite open intervals in R‘ are similar. 
Gi) Show that all open intervals in R! are similar. 
(iii) Show that all intervals in R* are similar. 


3. Let @ be a countable collection of disjoint sets such that, for all 4€@%, 
A ~ R’. Show that co 


AcE 


4. Show that, if the sets A,,..., A, are countable, then so is A, x...x A,. (In 
particular, if Q is the set of rational numbers, then Q” is countable.) 


5. Let P be the set of all sequences of 0’s and 1’s. Using the binary representation 
of real numbers prove that F is similar to the interval (0, 1). 

Show also that A? ~ FY and deduce that, if the sets A,, ..., A, are similar to 
R}, then A, x...x A, ~ R'. Un particular, R” ~ R'.) 


6. Prove that the set of all polynomials 
Ag X™ + ay xP t+... + An1X +n 


with integral coefficients is countable. Deduce that the set of algebraic numbers 
is countable. (An algebraic number is a number which is a root of an algebraic 
equation with integral coefficients.) 


7. Let R;(0 < i < 9) be the set of those real numbers in (0, 1) which do not use 
the numeral 7 in their decimal representation. Prove that R; ~ (0, 1). (Suppose 
that recurring 9’s are not used.) 
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8. A set E of real numbers is such that every series 
fea) 
Di ns 
n=1 
whose terms are distinct elements of E, converges. Show that E is countable. 


9. Given a set A, denote by “4 the set of subsets of A (including @ and 4A). 
Show that ; 
(i) if A has n elements, then “4 has 2” elements; 
(ii) if A is countably infinite, then Ws ~ R'; 
(iii) if A is any set, then A 4+ %4 (though clearly A is similar to a subset of 
Fa): 


10. Show that any collection of disjoint open intervals of R* is countable. 
Deduce that, if J is any interval in R! and the function f: J > R! is monotonic, 
then the set of points of discontinuity of fis countable. 

(if c €J and c is not an end point, then 


f(e+) = lim f(x), f(c—) = lim f(x) 


OY Daze Olen vs Z2>Ce— 
exist; see Cl, 32.) 


NOTES ON CHAPTER 1 


§ 1.1. Axiomatic set theory. The intuitive notion of a set, though perfectly 
adequate for everyday use by analysts, is, in fact, self-contradictory. To demon- 
strate this we describe the famous Russell paradox. A set may or may not be a 
member of itself: most sets are not and we call these normal; an example of a set 
which is a member of itself is the set consisting of all infinite sets (for clearly 
there are infinitely many of these). Consider the set N of all normal sets. Is N 
normal or not? If N is normal, then, by definition of normality, N ¢ N; but, by 
definition of N, a set which does not belong to N is not normal and we have a 
contradiction. The supposition that N is not normal is equally untenable. The 
mathematician’s way out of this dilemma is to construct an axiomatic system 
designed to mimic the desirable features of the intuitive approach and to avoid 
its pitfalls. The axiomatic theory of sets is described in Theory of Sets and 
Transfinite Numbers by B. Rotman and G. T. Kneebone. A more informal atti- 
tude is adopted by P. R. Halmos in Naive Set Theory. 

The theory of sets is the starting point of the monumental Eléments de 
Mathématique, which is designed to present the whole of pure mathematics in 
strictly logical order. This work, by a group of French mathematicians writing 
under the collective name of N. Bourbaki, has been appearing in sections since 
1939 and is not yet (in 1969) complete. 

The real number system. The set R? of real numbers, together with the operations 
of addition and multiplication and an ordering relation, is an ordered field: it 
satisfies the following six algebraic and three ordinal axioms. 

Al. To every pair of real numbers a, b correspond a real number a+b, the 
sum of a and b, and a real number ab, the product of a and b. 

A2. a+b = b+a; ab = ba. 

A3. (a+b)+c = a+(b+c); (ab)e = abe). 

A4. There are real numbers 0, 1 such that 


O+a=a and la=a 
for all real numbers a. 
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A5. For every real number a, there is a real number x such that a+ x = 0. 

For every real number a, other than 0, there is a real number y such that 
ay = 1, 

A6. (a+ b)c = act+hbe. 

O1. For every two real numbers a, b, one and only one of 


@>oby ta by 8. 8 
is true. 

O2. ifa > band b > c; thena > c. 

O3. Ifa > b, then a+c > b+c; if also c > O, then ac > be. 

What distinguishes the real number system from other ordered fields is 
Dedekind’s axiom: 

If L, R are two non-empty sets of real numbers such that LU R = R°’ and 
every member of L is less than every member of R, then either L has a largest 
member or R has a least member. 

The construction of such a richly endowed system is, not surprisingly, a major 
undertaking. The first step is to obtain the natural numbers 0, 1, 2, ...; and the 
most satisfying way of doing this is to use only the machinery of set theory (see, 
for instance, Halmos’s Naive Set Theory). Another possibility is to begin with 
the set of natural numbers, but to assume no more than a few simple properties 
(Peano’s axioms). This approach is adopted by L. W. Cohen and G. Ehrlich 
(The Real Number System), E. Landau (Foundations of Analysis) and H. A. 
Thurston (The Number System). There are now standard algebraic methods for 
proceeding first to the integers (0, +1, +2, ...) and then to the rational numbers. 
At this stage we have arrived at an ordered field which does not, however, satisfy 
Dedekind’s axiom. There are two principal constructions for the final step to 
the real numbers. The method of cuts is mentioned in C1 and is given in detail 
by Landau. Cauchy (or fundamental) sequences are used by Cohen and Ehrlich 
and by Thurston. This process is also described in the notes at the end of 
chapter 3, since it is closely related to some of the contents of that chapter. 

An ordered field is said to be Archimedean if, given any member of the field, 
there exists a larger integral member; an equivalent property is that between any 
two members of the field there is a rational field element. The real number system 
is of this kind, as is shown in (6) on p. 73. 


2 
METRIC SPACES 


2.1. Metrics 


Classical analysis deals with sets of points in and functions defined 
on Euclidean space or the complex plane, while in more recent 
developments prominent parts are played also by other types of 
spaces. These spaces lead to theories with many similar features. The 
reason is that the spaces share the same underlying structure. It is 
therefore both illuminating and also economical of effort to develop 
many aspects of analysis in a general setting which includes as special 
cases the various spaces of particular importance. A fundamental 
property which these spaces have in common is that in all of them 
there is a distance between any two points. We now lay down the 
conditions which such a distance function or metric must satisfy. 


Definition. Suppose that X is a (non-empty) set and that p is a real 
valued function on X x X with the following three properties. 

M1. p(x, y) 2 0 for all x,yeX and p(x, y) = 0 if and only if 
x=; 

M2. p(x, y) = p(y, x) for all x, ye X; 

M3. p(x, z) < p(x, y)+p(y, 2) for all x,y,zEX (the triangle 
inequality). 

The function p is called a metric or distance on X; and X, taken 
together with the metric p, is called a metric space which we denote 
by (X, p). 

The notation (X, p) emphasizes the fact that the set X¥ and the 
metric p have equal shares in the construction of the metric space. 
A given set may give rise to many different metric spaces by having 
different metrics associated with it. Nevertheless, when it is clear 
which metric is being used we may, for brevity, write X rather than 
(X, e) for the metric space. 


Illustrations 
(i) Any set X can be equipped with a metric; 
OAT ey, 
db eal 2 
pl, y) {1 ii x + y 


is such a metric, but not a very interesting one. (This metric is called the discrete 
metric.) 
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(ii) On R! the usual metric is given by 
p(x, y) = |x—-yI. 
The usual metric on R” is defined as follows. If 
x= Wy, 5% = Geo Vee, 
px, ¥) = MO yh +... + n= Yn)"3- (2.11) 


The case n = 1 gives the earlier definition of the usual metric on R’. 
Plainly p satisfies M1 and M2. Putting a; = x;—y:, b; = y;—Z,; we see that 
M3 is equivalent to 


n + n $ n $ 
(= (a+b) < & at) +(3 ei) (2.12) 
i=1 i=1 t=] 
By squaring both sides of (2.12) we obtain yet another equivalent form, namely 
n n oe $ 
i=1 i=1 i=1 
This now follows from the fact that, for all real &, 
n 
(a,6+5,;)? = Ag? +2HE+B => 0, 
i=1 
so that H? < AB, 
n n n 
where A= > @, 2 = 2 Ge Pe Bo, 
i=1 i=1 j= 1 


Therefore p, defined by (2.11) is a metric. 
The metric space (R”, p), where p is the usual metric, is called n-dimensional 
Euclidean space. 
(iii) On Z, the complex plane, the usual metric is of the same form as that 
on R?. If z = x+iy and z’ = x’+ iy’ (x, y, x’, y’ real), 
p(z, z’) = |z—z'| = Vx—-x’P + O—-y')"}. 


In fact, when equipped with their usual metrics, Z and R? are essentially the same 
metric spaces in which only the notations differ. 
(iv) Let Bla, b] be the set of bounded real functions on the interval [a, 5] and, 


for f, g € Bla, 5), let pf, g) = ee f(x) -—g(x)|. 


Clearly p has the properties M1 and M2. M3 follows from the inequalities 
sup |A(x) + ¥(x)| < sup (|A@)|+ |¥@)|) < sup |A@)|+sup |¥@)]. 


(v) Let S be the space of sequences x = (Xj, X2, ...) of complex numbers such 
that Xx, converges absolutely. For x, y¢ S put 


ce) 
P(x, y) = > |Xn—Ynl- 


n=1 


It is easy to show that p is a metric on S. 


Let (X, p) be a metric space and let Y be a subset of X. If o is the 
restriction of p to Y~x Y, 1.e. o(x, y) = p(x, y) for all x, ye Y, then 
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o is a metric on Y; it is said to be the metric induced by (X, p) on Y. 
The metric space (Y, o) is called a (metric) subspace of (X, p). For 
instance let R® have its usual metric p given by 


p(x, ¥) = V{1— yy)? + %2—Y2)? + (%3—Ya)"}- 
If Y is the set of points x € R* with x, = c, the metric o induced on 
Vis given by a(x, y) = V{— yi)? + 2a). 


In future, unless there is a statement to the contrary, the metrics 
on R” and Z will be taken to be the usual metrics; and subsets of R” 
and Z will be taken to have the metrics induced by the usual ones. 


Sequences of points. The following definition extends the notion of 
convergence of sequences of real numbers to sequences of points in 
a metric space. 


Definition. The sequence (x,,) of points in the metric space (X, p) is 
said to converge to the point x(e X) if 


P(Xn, xX) >O0 as n>, (2.14) 


We write, as in classical analysis, X, > xX asn—>o or lim x, = x. 


n—>o 


Since p(x, y) is a real number, we are able to use the concept of 
convergence in R!. We could replace (2.14) by the phrase ‘given 
€ > 0, there is an m, such that p(x,, x) < ¢ for n > n)’. This form 
of the definition reduces to the familiar definition of a convergent 
sequence of real numbers when (X, p) is R! with its usual metric. 


Exercise. Prove that a sequence cannot converge to two distinct limits. 


Illustrations 
n 2n? 
i I R? ] t eee Pea eer See RS 
ee a = Os =a 
Then, as 1 > 00, X_ >, 2), 


; 1 \2 4 2 
since P(Xn, x) = J (gs) + (+5) > 0. 
(ii) In Bl, 2] with p(f,g) = sup |f(x)—g(x)|, let f, be given by 
1<x<2 


if = tity". OC <%= 2). 
Then aoe 3 


where Jah ee xf) = x = 2). 
For, when 1 < x < 2, 


0 < f,(x)—f(x) = x07" +:1)""-— x << xQ2""-1) < 2Q1?”"-1) 
and so Phat) < 2Q2%"-1) 70 as n>, 


20 METRIC SPACES [2.1 


When the metrics p, 7 on a set X are such that, as n > ©, x, > x 
in (X, p) if and only if x,, > x in (X, o), then p and o@ are said to be 
equivalent metrics. We shall see, as the theory develops, that a change 
to an equivalent metric preserves many of the properties of a metric 
space. | 

A sufficient, though not necessary condition for the metrics p, 7 
to be equivalent is the existence of strictly positive constants A, 
seen Ana, ») < oC) < Mo») 


for all x, y e X. (See exercise 2(a), 7.) 


Exercises 2 (a) 


1. Is the function p given by p(x, y) = |x?—y?| a metric on (a) (—2, 0), 
(b) [0, 00)? 


2. Let C be the set of bounded and continuous real functions on R? and let p be 


defined by ox 
[ {6(t)—(0)} dt 


P(e, Y) = —— 


0<h<l1 


Show that p is a metric on C. 


3. Let X be a non-empty set and let p be a real-valued function on Xx X such 
that 


(i) p(x, y) = O if and only if x = y; and 
(ii) p(x, y) < p(x, z)+ PQ, z) for all x, y, z€ X. 


Prove that p is a metric on X. 
4. In R® let x, = (611, .--5 nz). Show that 
Xp X > tied 
as k > 00 if and only if, for 1 < i < n, i > &. 
5, In a metric space (X, p), Xn > x and y, > y as n > ©. Prove that 
P(Xns Yn) > P(X, ¥)- 
6. Show that, if p is a metric on X, then so is o given by 


p(x, y) 


ies 1+ p(x, y) 


and that p, o are equivalent metrics. 


7. Prove that the metrics p,o on X are equivalent if there are constants 


A, w > 0 such that : 
x Ap(x, y) < a(x, y) < mp(x, y) 


for all x, y ¢ X. Give an example to show that the converse is false. 
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8. On R?, o and 7 are defined by 
o(x, y) = max (|x, — il, |x2—-y2|), 7) = [x1 —- Yi] + |%2— yal. 


Show that o and 7 are metrics which are equivalent to one another and to the 
usual metric p. 

Compare the sets of points x in R? given by p(x, a) < 1, o(x, a) < 1,7(x,a) < 1 
respectively. 


9. Denote by a the discrete metric on R! (p. 17, illustration (i)). Prove that o and 
the usual metric p on R' are not equivalent. 


10. Let C be the set of real functions continuous on the interval [0, 1] and define 


o. Eo 
O0<ax<l 


/ il $ 
o(f, g) = (| If) —2e()|? ax) : 


1 
ae ede 


(A function continuous on a closed interval is bounded and integrable.) Prove 
that o and 7 are metrics on C. Show also that, for all f ge C, 


PS, g) = ofa) 2 TY, 8). 
For n = 1,2, ..., the functions f,, g, € C are defined by 
l—-nx O<x<n°, n(i—nx) (0< x <n), 
fx) = ErlX) = 
0 firt ty = 1); 0 (nz x=). 
Prove that the sequence (/,,) converges in (C, o) and in (C, 7), but not in (C, p); 
and that (g,,) converges in (C, 7), but not in (C, p) or in (C, 0). 


11. Let (X,, p,), .--; (Xns Pn) be any metric spaces. Verify that metrics on 
X1x ... x X, are given by 


a(x, y) = max {P,(X1, 1), --+» Pn(Xns Yn)}s 
T(x, y) = Mex, yi) +— oes + p?(Xns Vadis 
TAX, Y) = Pil%1, V+... + Pn(Xns Yn)» 


where x = (Xj, ---5 Xn)> ¥ = (1, «+s Yn) and x;, ¥, € X; Gi = 1, ..., n). Show also 
that 01, 02, 7; are all equivalent. 


2.2. Norms 


The theory of vector spaces is partly algebraic and partly analytical. 
We need only the most basic algebraic ideas and these are, no doubt, 
known to the reader. But, largely to fix notation and terminology, 
we formally define a vector space. 


Definition. Let V be a non-empty set and suppose that (i) to any two 
elements x, y of V there corresponds an element in V, called their sum, 
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which is denoted by x+y; and (ii) to any real number a and any x in V 
there corresponds an element in V, called the scalar product of « and 
x, which is denoted by ax. (Thus addition is a function on Vx V into 
V and scalar multiplication is a function on R!x V into V.) Suppose 
also that the following conditions are satisfied. 

C1) x+y = yt+x; 

(2) x+(y+z) = (x+y)+2z; 

(3) there is an element 0 & V (the zero element) such that x+0 = x 
for allx eV; 

(4) to every element x € V there corresponds an element —xéV 
such that x+(—x) = 90; 

(5) a(6x) = (a@f)x; 

(@) ix =x; 

(7) («+ f)x = ax+fx; 

(8) a(x+y) = ax+ay. 

Then V (together with the operations of addition and scalar multi- 
plication defined in V) is called a vector (or linear) space over R}, or a 
real vector space. 

If the scalar product is formed with complex numbers « and the same 
properties (1)-(8) hold, then V is said to be a vector space over Z, or a 
complex vector space. 

The simplest example of a real vector space is R” with addition 
and scalar multiplication defined by 


(x1, cers Xe) FAs eeeg Yn) $563 (x1, +1, eceg Xa tek 
WN, 25 Mok A 


A more sophisticated example is Bla, b]; here f+ g and af are defined 


by 
(ft+e)) =fx+ex), PmM=fX) (<x Jb). 


We shall assume the elementary algebraic properties of vector 
spaces. Some of these are given in exercise 2(b), 1. 


Definition. Let V be a vector space. The real valued function || .|| on V 
is called a norm on V, and V is called a normed vector space, if 
N1. |x|] > 0 for all x € V and ||x\|| = 0 if and only if x = 6; 
N2. |jx-+yl] < |x| + [yl for all x, y € V: 
N3. ||ax|] = |c|||x|| for every number « and all x € V. 
It is easy to see that, in a normed vector space, a metric p may be 


defined by p(x, y) = |x—y]. 
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In fact, the metrics in illustrations (ii)-(v) of §2.1 all arise from 
norms. For instance in R” the usual norm is given by 


x] = Git... +27) 
and in Bla, 5] If] = sup [0]. 


However, a metric on a vector space need not be derivable from a 
norm (see exercise 2(5), 2). 

More specialized than normed vector spaces are inner product 
spaces. 


Definition. Let V be a complex normed vector space and suppose that, 
with every ordered pair (x, y) of elements in V there is associated a 
complex number, denoted by x.y, and that the following conditions are 
satisfied. 

Il. x.y = y.x for all x,yeV; 

12. (ax+fy).z = a(x.z)+f(y.z) for all numbers a, P and all 
x, V,ZEV; 

13. x.x = |x|? for all xe V. 

Then x.y is called the inner product of x and y and V is called a 
complex inner product space. 

If V is areal normed vector space, x.y is real and 11-13 hold, then V 
is called a real inner product space. Note that, in this case, I1 reduces 
tox.y=y.xX. 

It is shown in exercise 2(b), 4 that no more than one inner product 
can be associated with a given norm. 

Once again R” provides the most obvious illustration. The inner 
product for the usual norm is defined by 


X.Y — XyVy+ bisce ee 
The inequality (2.13) can now be written as 


x.y < [al bl 


and is a particular case of the following result. 


Theorem 2.21. (Cauchy’s inequality.) If V is a real or complex inner 
product space, then, for all x, y € V, 


Ix-y| < ||| [yl 
Equality holds if and only if one of x, y isa scalar multiple of the other. 
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Proof. Since 0.y = O (see exercise 2(5), 6), we may assume that 
x 0. Take i y.x 


Il 


Then 0 < [fx+yll? = (Ax+y).(2x+y) 
EP |xl?+A@.9+2O-9+I0 2 


= Tae j=12—2 FE + Dol 


Moreover ||fx+ yl] = 0, ie. Ax+y = 9, if and only if y = ax for 
some number a. | 


Exercises 2 (6) 


1. Show that the following identities hold in a vector space: 
(i) Ox = 6; (ii) (-—1)x = —x; (iii) «0 = 0; 
(iv) if x—y denotes x+(—y), then 


—(x+y) = —x-y,  —(&—y) = yx. 


2. Prove that the discrete metric o on R? (see illustration (i) of § 2.1) cannot be 
derived from a norm. 


3. Two norms on a vector space V are equivalent if they give rise to equivalent 
metrics. Prove that the norms ||.||, and ||.||, are equivalent if and only if there 
exist strictly positive constants A, ~ such that 


Alxl], < xl. < #lxh, 
for all x € V. (Contrast exercise 2(a), 7.) 


4. Show that if, in a normed vector space, an inner product exists, then it is 
unique. 


5. The space T consists of the sequences x = (X, X2, ...) of complex numbers 
such that &|x,,|? converges. Verify that p given by 


00 $ 
p(x, y) = ( x nal) (x, y€T) 


is a metric on 7; that the metric arises from a norm; and that to this norm there 
corresponds the inner product 


x.y= > Xa Vex 
n=1 
6. Show that in an inner product space x.0=0.x=0. 
7. Prove that, if V is an inner product space, then, for all x, yé€ V, 
+ y|?+ |xayl? = 21x]? + [y/?). 


Hence give an example of a normed vector space which is not an inner product 
space. 
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8. The sequences (x,), (yn) in the normed vector space V converge to x,y 
respectively, and the numerical sequences («,), (Zn) converge to «, # respectively. 


Show that 
An Xn + aa > AX+ Py. 


Prove also that, if V possesses an inner product, then 
Xn-YVn >X.). 


9. Alternative definition of inner product space. Let V be a complex (real) vector 
space and suppose that, with every ordered pair (x, y) of elements in V, there is 
associated a complex (real) number x.y such that I1, [2 are satisfied and 

13’. for all xe V, x.x > O and x.x = 0 if and only if x = @. 

Show that ||x|| = /(x.x) is a norm on V. 


2.3. Open and closed sets 


In this section we consider subsets of a fixed metric space CX, p). 
We shall use the symbol X for the metric space as well as the set 
without reiterating that p is specified. 

Let ae X and let r be a positive real number. The set of points x 
such that p(a, x) < ris called the open ball with centre a and radius r; 
we shall denote it by B(a;r). In R', B(a;r) is the open interval 
(a—r, a+r). In R? and in Z an open ball is usually called an open 
disc. 


Definition. Given a subset E of X (which may be X itself), the point 
ce X is said to be a limit point of E if, for every ¢ > 0, there is an 
x € E such that 0 < p(c,x) <6, 


i.e. B(c; €)—{c} contains a point of E. 

A limit point c of E can also be defined by either of the following 
two conditions: 

(i) Every open ball B(c; €) contains infinitely many points of E. 

(ii) There is a sequence of points x, ¢ £ such that x, + c and 
Xn, >casn—>o., 

It is clear that a point c with properties (i) or (ii) is a limit point of E. 
Conversely, if c is a limit point of E, then, for every n, there is a point 
X, € E such that 0 < p(c,x,) < 1fn 


and so (ii) is satisfied. Also (ii) implies (i). 

The conditions (i) or (ii) show that a finite set cannot have a limit 
point. An infinite set may or may not have a limit point. For example 
the subset {1, 2, 3, ...} of R* has no limit point. 

It is important to note that a limit point of a set E may or may not 
belong to £. For instance if, in R', E is the interval (0, 1), then every 
point in the interval [0, 1] is a limit point of E. 
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A point c of E is said to be an isolated point of E if it is not a limit 
point of E. Thus, if c is an isolated point of EF, there is a d > 0 such 
that B(c; 6) contains no point of £ other than c. 


Definition. Given a set E < X, the point c € E is said to be an interior 
point of E if there isa 6 > 0 such that B(c; 9) © E. 


Illustrations 


(i) All points of an open ball B(a; r) are interior points of B(a; r). For suppose 
that c € B(a; r), so that p(a, c) = s < r. If 6 = r—s and p(c, x) < 40, then 


pla, x) < p(a,oc)+p(c, x) << std=r 
and so x€ B(a;r). Therefore B(c; 5) < B(a;r), i.e. c is an interior point of 


Gi) Xx =-Rt. 

Let E = [0, 1). The set of limit points of E is [0,1]; the set of interior points 
of E is (QO, 1). 

Let E be the set of rational points of R1. Every point of R? is a limit point of E. 
E has no interior points. 

tii). X= Z. 

Let E be the annulus consisting of the points z such that 1 < |z| < 2. The set 
of limit points is the set of z such that 1 < |z| < 2. The set of interior points 
is E. 

Let E be the set of points x+iy such that y = —1 and0 < x < 1. The set of 
limit points is E. There are no interior points. 


When the space X has no isolated points (as is the case with R” 
and Z) then an interior point of a subset FE of X is also a limit point 
of £. But if X is, for instance, the space consisting of all the integers 
(and p is the usual metric in R4), then every subset E of X is without 
limit points although all the points of E are interior points of E£. 


Definition. (i) A set F in X is said to be closed if F contains all its 
limit points. 

(ii) A set G in X is said to be open if every point of G is an interior 
point of G. 


Illustrations 


(i) Every open ball in a metric space is open. A set {x|p(a, x) < r} is closed 
(see exercise 2(c), 2) and is consequently called the closed ball with centre a and 
radius r. 

Any set without limit points is closed; in particular, every finite set is closed. 

(ii) In R”, every open interval is open and every closed interval is closed. 

(iii) In R', the interval [0, 1) is neither open nor closed; the set of rational 
points is neither open nor closed. 

(iv) In Z, the annulus defined by 1 < |z| < 2 is open; the real axis (im z = 0) 
is closed; the set of points z = (1+i)/n(m = 1, 2, ...) is neither open nor closed. 
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(v) Let (C[a, b], p) be the metric space of real functions continuous ape sO 
bounded) on the interval [a, 5], in which 


PL g) = Boe [f(x) -—g(x)|. 


Then the set £, of f such that 
inf f(x) >0 


axx<b 


is open and the set E, of f such that 


f@ = 
is closed. 

(vi) In any metric space the empty set and the whole space are both open and 
closed. In R” and Z these are the only sets that are both open and closed (theorem 
3.32, corollary). In an arbitrary metric space there may be others. For instance 
in the space of integers (with the usual metric of R*) every set is both open and 
closed. The question is taken up again in §3.3. 

(vii) The property of being open or closed (or neither) is not intrinsic to a 
given set E; it is relative to the set X of which E is regarded as a subset and the 
metric p with which X is endowed. For instance [0, 1) is neither open nor closed 
as a subset of R* equipped with the usual metric, but [0, 1) is closed as a subset 
of (—1, 1) with the same metric; and as a subset of (R?, ~), where o is the discrete 
metric, [0, 1) is both open and closed. 

A convenient characterization of closed sets is embodied in the 


following result. 


Theorem 2.31. A necessary and sufficient condition for the set F to be 
closed is that, whenever (x,) is a convergent sequence of points in F, 
lim x, € F. 


V ee eh ,2) 


Proof. First suppose that F is closed and that x, — x, where 
x, € Ffor every n. If x,, = x for all sufficiently large n, then, a fortiori, 
x € F; otherwise x is a limit point of F and again x e€ F. 

Next, let F have the property described in the theorem. If F has no 
limit points, then F is closed. If F has a limit point x, then there is a 
sequence (x,) of points in F such that x, > x, andsoxeF. | 


Theorem 2.32. (i) The set G in X is open if and only if G’ (= X—G) 
is closed. 
(ii) The set F in X is closed if and only if F’ (= X—F) is open. 


Proof. We need only show that (a) if G is open, then G’ is closed 
and (b) if F is closed, then F’ is open. 

(a) Let G be open. If G’ has no limit points, then G’ is closed. If G’ 
has a limit point c, then every open ball B(c; €) contains points of G’. 
Hence c cannot be an interior point of G and, since G is open, c ¢ G. 
Thus ce G’. We have therefore shown that G’ contains all its limit 
points. 


28 METRIC SPACES 12.3 


(b) Let F be closed. If F’ is empty, then F’ is open. If F’ is not 
empty, let x be any point of F’. Since F is closed and x ¢ F, it follows 
that x is not a limit point of F. Hence there 1s a B(x; 0) free of points 
of F, i.e. there is a B(x; 6) < F’. Thus the arbitrary point x of F’ is an 
interior point of F’. | 


We have stressed the fact that a given set is open or closed (or 
neither) only with respect to the metric space in which it is embedded. 
It is therefore of interest to note that, if p and o are equivalent metrics 
on X, then the metric spaces (X, p) and (X, 7) have the same open 
and the same closed sets. This follows easily from the last two 
theorems. The converse also holds. 

The next theorem concerns collections of open or closed sets. 
These collections may be finite or infinite and need not be countable. 


Theorem 2.33. (i) The union of any collection of open sets is open. 
(ii) The intersection of any collection of closed sets is closed. 


Proof. (i) Let Y be a collection of open sets G. If xe U G, then 
GegY 


there is a member of Y, say G*, such that x e G*. Since G* is open, 
x is an interior point of G*, i.e. there is a 0 such that 


Bx 6) = G* 
and so also Be: oy = v G. 


Hence x is an interior point of ae G. This shows that U G is open. 
EG 


GeGY 
(ii) Let ¥ be a collection of siByaid sets F. We could show directly 


that ( F contains all its limit points; but, to illustrate the use of 
FeF 


complements in proving ‘dual’ results for open and closed sets, we 
use (i) as follows. 
Since F’ is open, whenever Fe F, by (i), 


is open. Hence, by theorem 2.32, | Fis closed. | 
FeF 


Theorem 2.34. (i) The intersection of a finite collection of open sets 
is open. 

(ii) The union of a finite collection of closed sets is closed. 

The proof is left to the reader. 


2.3] OPEN AND CLOSED SETS 29 


Exercise. Give examples of (i) a sequence of open sets whose intersection is not 
open and (ii) a sequence of closed sets whose union is not closed. 


Let (Y, «) be a subspace of the metric space (X, p). Then (Y, 7) has 
open and closed sets and, although a set which is open (closed) in Y 
is, of course, generally not open (closed) in X, the relationship be- 
tween open and closed sets in X and in Y is very simple. 


Theorem 2.35. Let (Y, a) be a metric subspace of the metric space 
(X, p). Then a set is open in (Y, 0) if and only if it is of the form 
Y n G, where G is open in (X, p). A similar result holds for closed 
sets. 


Proof. First, if G is open in X, then clearly every point of Y n G 
is an interior point of Y n Gin Y. 

Next, let E be open in Y. Then, for every x € E we can find a 0, 
such that ye E whenever ye Y and o(x, y) < 4,. If now B(x; 4,) 
denotes an open ball in X, the set 

G= U B(x; om) 


re 
is open in X¥ and EF = Y n G. 
The result for closed sets may be deduced by the method of taking 
complements. | 


Corollary. If Y is open in X, then a subset of Y is open in Y if and only 
if it is open in X. If Y is closed in X, then a subset of Y is closed in Y 
if and only if it is closed in X. 


Definition. Let E be any subset of X. 

(i) The interior of E, denoted by E”, is the set of interior points of E. 

(ii) The closure of E, denoted by E, is the union of E and the set of 
limit points of E. 

Thus £ is the set of x such that every B(x; €) contains at least one 
point of E or, equivalently, x ¢ E if and only if x is the limit of a 
sequence of points in E£. 

For alternative definitions of the notions of interior and closure 
see exercise 2(c), 6. 


Theorem 2.36. For any set E in X, E° is open and E is closed. 


Proof. If E° is empty, it is open. If E° is not empty, let x be any 
point of E°. Then there is a B(x; 0) ¢ E. Every ye B(x; 6) is an 
interior point of B(x; 6) and so of E. Hence B(x; 6) < E° and there- 
fore x is an interior point of E°. 
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It may be shown directly that £ is closed. The result also follows 


from the identity (EY = (ey 


which is easily proved. (See exercise 2(c), 7.) | 


Open and closed sets could also have been defined in terms of 
interiors and closures: G is open if and only if G° = G; Fis closed if 
and only if F = F. 


Definition. A frontier point of a set E in X is a point c such that 
every open ball Bic; €) contains at least one point of E and at least one 
point of E’. The set of all frontier points of E is called the frontier of E 
and is denoted by fr E. 

Clearly E and E’ have the same frontier. 

Since a frontier point of E is a point of E which is a limit point of 
E’ or a point of E’ which is a limit point of EZ, it follows that 


frE = E-E’. (2.31) 
Illustrations 
(i) Let E be the interval [0, 1) in R'. Then E° = (0, 1), E = [0, 1] and frE = 
{0, 1}. 


(ii) Let CX, ) be the space of integers with the metric induced by R’. If 
E = Bic; 1) = {c}, then E° = E = E. Note that E is not the closed ball 


{x|p(x, c) < 1} = {c-—1, c, c+ 1}. 


Exercises 2(c) 


1. Show that every infinite set XY may be equipped with a metric p which is such 
that X has a limit point in CX, p). 
2. Prove that, in any metric space (X, p), the closed ball {x|p(a, x) < r}is closed. 


3. Prove that a bounded, closed set of real numbers contains its supremum and 
infimum. 


4. Let E be an arbitrary set in a metric space. Show that the set E* of limit points 
of E is closed. 


5. Show that in R” with the usual metric any collection of disjoint open sets is 
countable. Is this true for an arbitrary metric space? 


6. (i) Show that E° is the union of all open sets contained in E and so is the 
‘largest’ open set contained in E. 

(ii) Show that E is the intersection of all closed sets containing E and so is the 
‘smallest’ closed set containing E. 


7. Prove that, for any subset E of X, 

E°’ufrEvuCey = xX 
and fy = FY. 
(The set (E’)° is called the exterior of E.) 
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8. Prove that frE= EnE’ 
and that fr E is closed. 
9. (i) Prove that Eo... 82, = &, VE, 
but that, for an infinite collection & of sets E, generally the relation 
FES Ye 
Ee&€é Eeé 
only holds. 
(ii) Prove that, for any collection & of sets E, 
Pm £. 
Eeé EHeé 


Construct an example to show that identity need not hold even for a finite 
collection. 
(iii) Deduce the results for interiors corresponding to (i) and (ii). 


10. The metric spaces CX, p), (X, 7) have equivalent metrics. Show that every 
subset E of X has the same limit points and interior points in CX, p) and CX, 0). 


11. If E is a set in a metric space and the set D is such that 
Dc Ec DP, 


then D is said to be dense in E. (For instance the set of rational numbers is dense 
in R1.) Show that, if C is dense in D and D is dense in E, then C is dense in E. 


NOTES ON CHAPTER 2 


§2.1. The idea of a metric space is due to M. Fréchet. In his doctoral thesis, 
published in 1906, he examined various sets of axioms for a metric. The set 
M1— M3 is the one which experience showed to be the most fruitful. 

The inequality (2.12), which is equivalent to the triangle inequality for the 
usual metric on R”, is a particular case of Minkowski’s inequality 


n 1/r n 1/r n 1/r 
(= (a+b) & oe «) + ( p> e) 


t=1 i=1 


in which a;,b; > O(@i = 1,... m) andr > 1. (In (2.12) the a,, b; may be negative, 
but the step from non-negative a;, b; to arbitrary ones is trivial.) The inequality 
(2.13), which we call Cauchy’s inequality in theorem 2.21, is also known as 
Schwarz’s inequality (or even the Cauchy—Schwarz inequality). It is a particular 
case of Holder’s inequality. 


n n jp /n 1/q@ 
> a:b; < (> a?) (= at , 
i=] i=1 ee | 


where a;,b; > 0 (i = 1,...,”), p> 1 and p-!+q- = 1. Both Hdlder’s and 
Minkowski’s inequalities may be generalized in a variety of ways. (See Hardy, 
Littlewood and Polya, Inequalities.) 
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CONTINUOUS FUNCTIONS ON 
METRIC SPACES 


3.1. Limits 

The notion of continuity is based on that of a limit and we first 
define the limit of a function whose domain and range lie in arbitrary 
metric spaces. 


Definition. Let (X, p) and (Y, ©) be metric spaces. Let f be a function 
with domain E < X and with range in Y. Suppose also that Xp» is a 
limit point of E and y, € Y. We then say 

IDF BX Or mM TO) = 


L>2ZXq 


if, given € > 0, there is ad > 0 such that 
o(f(x), Vo) < € whenever xe E and 0 < p(x, Xo) < 4. 


Notes. (i) It is irrelevant whether x, belongs to the domain E of f 


or not. 
(ii) When X¥ = Y = R! (or Z) the above definition reduces to the 
standard definition of the limit of a real (or complex) function. 


Exercise. Show that a function cannot converge at a point to more than one 
limit. 


Illustrations 


(i) X= Y=R | 
(a) With E = RY, lim xt = x5 (k = 1,2,. = 


(6) With E = R'- 0}, — x sin(t/x) = 0, si sin (1/x) does not exist. 
0 +0 
(c) Let E be the set of “meaanel numbers aad let f be defined on E by the 


equation 
és fla) = 114 
when p, g are coprime integers and gq > 1. We shall show that, for every ce R’, 
lim f(x) = 0. 
4 Ved £4 


Given € > 0, let g) be an integer such that 1/q,. < e. Denote by Q, the set of 
rational numbers p/q with g < qo. We note that, in any finite interval, there is 
only a finite number of members of Q). Hence there is a 6 > O such that the 
intervals (c—6, c) and (c, c+9) contain no members of Q,. It follows that, 
whenever 0 < le= =—pig| <= 8, 

1 


o<s(2)-2 <2 <c 
q Q 40 
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(ii) X = R*, E = R?—{(0, 0)}, Y = R’. 


Let g, be given by . 


=. 
&1( y) x+y? 
x? 
Then x.y) = ian 
|ex¢ y)| |x| x?+y? |x| 
and lim — g,(x; y) = 0. 
(x, y)>(0, 0) 
Let g, be given b ie tee a 
et g, be given a ee 


Then lim go(x, y) does not exist. For, ify = mx, we have (for all x + 0), 
(2, y)>(0, 0) 

(x, mx) = eae = eo : 

aes x24 mx? 14+ mm?’ 


and so g, takes all values between 0 and 1 in any annulus 0 < x°+y? < 62, 
(iii) Let E = X = Y = C(O, 1] (the set of real functions continuous on (0, 1]). 
The metric p is, as is usual on C[0, 1], given by 


P(P1, 92) = , |¢.(t) — ¢,(t)|. 
The function A from C[0, 1] to C[0, 1] is given by h(¢) = ¥, where 
t 
we = | dem ar. 


If, for instance, ¢, is given by ,(t) = £2, then lim A(d) = Wo, where Y(t) = 47°. 
o>Po 
For, if p(¢o, 2) < ¢, then, forO < ¢ < 1, 


t 
31 - [6 dr| = ie — 9(7)} dr 


yo(t)-—w(t)| = 


< [ |7?— (7)|dr < € 
and therefore p(W%o, YW) < ¢. 0 
A sequence d,, dy, a3 ... may be regarded as a function on the set 
{1, 4, 4, ...}. It follows that the notion of a sequential limit may 
be expressed by means of the notion of a functional limit. What is 
less obvious, and more interesting, is that the opposite statement is 
also true. 


Theorem 3.1. Let (X,p) and (Y,o) be metric spaces. Let f be a 

function with domain E < X and with range in Y and suppose that Xo 
is a limit point of E. Then f(x) > yo aS X > Xq if and only if, for every 
sequence (X,) in E—{xo} such that X, > Xo, f(Xn) > Yo ASN > ©. 


Proof. (i) Suppose that f(x) > yo as X > Xo. Then, given e > 0, 
there is a 6 > O such that 


o( f(x), yo) < € Whenever x € E and 0 < p(x, X%) < 4. 
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Now, if (x,,) is any sequence in E— {x} such that x, > Xo, there is an 
integer m) such that 

0 =x, to) <0 fOr 8 


Therefore o({(%,); Vers € Tor w> mM, 
Le. | (Xn) >Vo AS N>O. 


(ii) Suppose that, whenever (x,) is in E— {xo} and x, > Xp, 
7 (Xn) re 

If f(x) + yo aS X > Xo, then there is an e¢ > 0 with the property 
that there is no 6 > O such that o( f(x), yo) < € whenever x € E and 
0 < p(x, X) < 6. Therefore, given any n, there is an x, € E such that 


0 < PA(Xns Xo) < I/n and O(f (Xn), Yo) 2 €, 


Thus there is a sequence (x,) in E—{x,} such that x, > x) and 
(Xn) +> Yo. This contradicts our original hypothesis and so f(x) > yo 
aS X > Xp. | 


For functions with values in a normed vector space V there is, as 
might be expected, an algebra of limits. Let f, g be such functions 
defined on a subset E of X. Also suppose that £ has a limit point Xo 
and that 
I(x) > Yo. Q(X) >% AS X>X. 


When ¢, y are real or complex valued functions on F and 


P(x) >a, Y(xX)>h as x>Xo; 


then A(x) f(x) + W(x) g(x) > 0+ Zo. (3.11) 
If V possesses an inner product, then also 

I(x).8(X) > Yo-2Z03 (3.12) 
and if Vis R' or Z, F(x) g(x) > YolZo (3.13) 


provided that Zz) + 0. 

These results may be proved directly or they may be deduced, by 
means of theorem 3.1, from the corresponding results on sequences 
(exercise 2(b), 8). 

Exercises 3(a) 
1. The real valued function fon R?—({(0, 0)} is defined by 
xy 
f(x, y) = ae 


Show that lim f(x, y) does not exist. 
(x, y)—>(0, 0) 
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2. Let (X, p), (Y, ©) be metric spaces. Let f be a function with domain E ¢ X 
and with range in Y, and suppose that x, is a limit point of E. Show that, if 


lim f(x,) exists for every sequence (x,) in E—{x,} such that x, > Xo, then 
n> 


lim f(x) exists. 
L—>2Xy 


3. Let (X, p), (Y, ©) be metric spaces, and let E be a subset of X, xo a limit point 
of E, yg a point of Y. Show that, if the function f: E— Y is such that f(x) > yo 
as X > Xo, then y, € f(Z); and that if, in addition, fis injective, then yo is a limit 
point of f(£). 


4. Let (X, p), (Y, 7), (W, 7) be metric spaces and let D, E be subsets of X, Y 
respectively. Leth = go f: D > W be the composition of the functions f:D > Y, 
where f(D) < E, and g: E> W. Also (i) xp is a limit point of D and f(x) > ¥o 
aS X > Xp; (ii) Vo is a limit point of E and g(y) > wo as y > Yo. Show that, when f 
is injective (so that, by 3, y is automatically a limit point of EZ), then h(x) > wo 
as xX > x». Construct an example in which h(x) +> wo. 


5. Prove that the statements (3.11)-(3.13) hold under the conditions given at the 
end of the section. 


3.2. Continuous functions 


The definition of continuity which we shall now give is a natural 
generalization of the concept familiar in elementary analysis. 


Definition. Let (X, p) and (Y, ©) be metric spaces. Then the function 
f: X > Yis said to be continuous at the point xy of X if, givene > Q, 
there is a 0 > O such that 


o(f(x), f(%)) < € whenever p(x, Xo) < 9 
(or f(B(%o; 8) <= BUF (%); €)). 


If f is continuous at all points of a set, it is said to be continuous on 
that set. 

The definition implies that f is continuous at a limit point x, of 
fx) > fl) a8 x > 395 
and that fis continuous at all isolated points of X (for if x, is an 
isolated point and 6 is sufficiently small, the only point x such that 
P(x, Xo) < 8 Is Xp itself). 

Let f be a function whose domain is a subset X, of a metric space 
(X, e). When considering the limits of f we are interested in the limit 
points of X, and these need not belong to X;. However, for questions 
of continuity, only the points of the domain Xj, of f are relevant. 
Since X,, endowed with the metric induced by (YX, p), is itself a 
metric space, it is therefore simplest in this context to treat X, as the 
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fundamental space. This is, in fact, what we did in the definition of 
continuity. 


Illustrations. The function f of illustration (i)(c) on p. 32 whose domain is the 
set of rational numbers, is discontinuous everywhere. If f,, with domain R°* is 


fi 
defined by f(x) when x is rational, 


fix) = 


0 when x is irrational, 


then f, is continuous at all irrational points and discontinuous at all rational 
points. | 
The function A: C[0, 1] ~ C[0, 1] of illustration (iii) on p. 33 is continuous 
on C[0, 1]. This follows from the inequality 
P(h(d,), A(P2)) < (Pr, Po) 


which holds for all $1, . of C[0, 1]. 
Let (X, p) be any metric space, let a be a fixed point of X and let the function 
g: X > R' be defined by the equation 
g(x) = p(a, x). 
Since P(a, X1)— p(a, X2) < P(%2, x1) 
and P(A, X2)— P(A, X1) < P(%1; Xa); 
lg) —2e(x2)| = |p(a, x1) — pla, X2)| < P(%, Xe) 


for all x,, x. in X. Hence g is continuous on X. 


Algebraic operations on continuous functions with values in a 
normed vector space again yield continuous functions. If the functions 
f,g:X > V and the functions ¢, ~: X > R' (or Z) are continuous 
at x), then ¢f+yeg is continuous at x); so is f.g if V is an inner 
product space, and //g is continuous at x» if Vis R* or Z and g(x9) + 0. 
All this is obvious when x, is an isolated point of X and otherwise 
follows from the corresponding results on limits. In the next theorem 
we return to functions with arbitrary ranges. The result contrasts 
with exercise 3(a), 4. 


Theorem 3.21. Let (X, p), (Y, 0), (W,7) be metric spaces and let 
h = gof: X > W be the composition of the functions f: X > Y and 
g: Y>W. If f is continuous at Xo and g is continuous at yo = f(Xo), 
then h is continuous at Xp. 


Proof. Given é, there is an 7 such that 
7(g(y), 8(0)) < € Whenever o(y, yo) < 7. 
There is now a 6 such that . 
o(f(x),f(%0)) < 7 whenever p(x, Xo) < 3 
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and so 6 is also such that 
t(h(x), h(x)) < € Whenever p(x, Xo) < 6. | 


It is important to note that the inverse function of a bijective 
continuous function is generally not continuous. A simple example 
is the function f with domain X = [0, 1) u [2, 3] and range Y = [), 2] 
defined by (x 


f(x) = 


for 0 <= x < I; 
lx-1 inl <4 46>. 


If Yand Y have the metrics induced by R’, then fis continuous on X. 
However f—, which is given by 


forO < y< Il, 
10) = . ‘ 
y+. forl < y < 2, 
is discontinuous at the point 1. For other examples see exercises 
3(b), 4 and 5. 
The concept of continuity can be formulated in a variety of ways. 
We shall next consider equivalent definitions, first for continuity at 
a point and then for continuity on the domain of the function. 


Theorem 3.22. Let (X, p) and (Y, 0) be metric spaces. A necessary 
and sufficient condition for the function f: X — Y to be continuous at 
the point Xo is that X, —> Xo implies f(Xn) >f(%). 

The proof is similar to that of theorem 3.1. The theorem may also 
be deduced from theorem 3.1, but slight complications arise, since 
members of the sequence (x,,) may now be Xp. 

Theorem 3.22 shows that the introduction of equivalent metrics 
in the domain or the range of a function does not affect the property 
of continuity: if p,p, are equivalent metrics on X and o,¢, are 
equivalent metrics on Y, and if f: X > Y is continuous at x» with 
respect to p and a, then fis also continuous at x» with respect to p, 
and 0}. 


Theorem 3.23. Let (X, p) and (Y, 0) be metric spaces. Each of the 
following conditions is necessary and sufficient for the function 
f: X = Y to be continuous on X: | | 

(i) Whenever G is open in Y, then f-\(G) is open in X. 
- (ii) Whenever F is closed in Y, then f-(F) is closed in X. 


Proof. (i) First suppose that fis continuous on X and that the set 
G is open in Y. 
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If the set f-(G) is empty then it is also open. If f-(G) is not empty, 
let x9 be any one of its points. Then f(x) € G and, since G is open, 
there is an € such that 


yeG whenever o(y, f(Xo)) < €. 
But, since fis continuous at xp, there is a 6 such that 
o(f(x), f(Xo)) < € Whenever p(x, Xo) < 0. 
Hence f(x)€G whenever p(x, Xo) < 4, 
Le. xef-(G) whenever p(x, Xo) < 6. 


This shows that x, is an interior point of f-'(G). It follows that 
f-\(G) is open in X. 
Next suppose that, whenever Gis open in Y, then f~(G) is open in X. 
Let x) be any point of XY and take any positive e. The open ball 
B(f(%); €) is open in Y and so, by hypothesis, the set f-1{B( f(x»); €)} 
is open in X. This set also contains x, and therefore there is a d > 0 


such, that x € f-{B(f(%); €)} whenever p(x, %) < 46, 
Ls. F(x) € BUf(%); €) + Whenever p(x, X) < 4, 
i.¢. o( f(x), f(%)) < € whenever p(x, Xp) < 6. 


Thus f is continuous at x, and so on X. 
(ii) This part follows from (1) by the method of complements since, 
if E is any subset of Y, 


PE) a f{E) = 9 and P(E) u fF) = X. | 


In the last theorem the inverse images may not be replaced by direct 
images. It is not true that f(S) is open (closed) in Y whenever S is open 
(closed) in X. The function defined on p. 37 will illustrate this 
(exercise 3(b), 6) but it 1s instructive to employ also more common- 
place examples. For instance the function f with domain R! and 


range R! given by f(x) = x(x—1)(x—2) 


maps the open interval (0, 1) onto (0, 3]. Next, let g, with domain 
(—0o, oo) and range [—4, 4] be defined by 


Xx 
a) = Te 


This function maps the closed interval [1, oo) on the interval (0, 4] 
which is not closed in [—4, 4]. 
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We have previously mentioned that a bijective continuous function 
ft: (X, p) > (Y, ©) need not be such that f-': (Y, 7) > CX, p) is con- 
tinuous. When f~ is continuous, f is called a homeomorphism. (Note 
that then f—! is also a homeomorphism.) Two metric spaces are said 
to be homeomorphic if there exists a homeomorphism on one into 
the other. If f:(X,p)~>(Y,¢) is. a homeomorphism, then, by 
theorem 3.23, fmaps open (closed) sets in X into open (closed) sets 
in Y and there is a bijection between the open (respectively closed) 
sets in the two spaces. 

A bijective function f: (X, p) > (Y, 7) such that 


o(f(x1), f(%2)) = PX X2) 


for all x,, x, € X is called an isometry. It is plain that an isometry 
is a homeomorphism. Two metric spaces are called isometric if there 
exists an isometry on one into the other. Such spaces have identical 
metric properties although their elements may be of entirely different 
kinds. For instance R! is isometric with the subspace A of C0, 1] 
consisting of the functions ¢, (— 0 < A < o) defined by 


o(x) = Ax (0 <x < 1). 


Another example of an isometric pair is provided by R?, Z. 


Linear functions. We briefly consider a class of functions which 
plays an important part in modern analysis. 


Definition. Let V, W be two real (complex) normed vector spaces. A 
function f: V + W is called linear if 


S (Oy Xy +L gXq) = Xf (X1) + Hef (Xe) 


for all x1, X_, € V and all real (complex) numbers &,, >. 

~ Clearly £(0) = 6, where the letter 0 on the left stands for the zero 
element in V and that on the right for the zero element in W. (It is 
not customary to use distinctive notations for the zero element and 
the norm in the two spaces.) 


Illustrations 
(1) The function f: C[a, b] > R' defined by the relation 


b 
fre fe A(t) dt (be Cla, oD 


is linear. 
(2) The reader is probably acquainted with linear functions on one Euclidean 
space into another. We shall use three properties of these functions which are 
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proved in most books on linear algebra. (See, for instance, L. Mirsky, Jntro- 
duction to Linear Algebra, chapters IV and V.) 
(i) The function f: R” > R” is linear if and only if it is of the form 


f(%1, ines Xm) = On, pha Yn)s 


where ‘i= anixXy+ Et QimXms 
nee Seen ee er er (3.21) 
Yn = Any Xi + eoe ws apt OT, 
V1 Qy1.--Aim xy 
or ; —— | cevcceeee : 
Yn Any---Anm Xm 


The nx m matrix (a;;) is said to represent f. 

(ii) If the linear functions f: R” > R” and g: R" > R? have matrices A, B, 
then the linear function go f: R”™ > R? is represented by the matrix BA. 

(iii) Let f: R™ > R® be a linear function with matrix A. Then / is injective if 
and only ifm => mand A has (maximum) rank m. Also fis surjective (i.e. the range 
of fis the whole of R”) if and only if 2 < mand A has (maximum) rank n. Thus, 
a necessary and sufficient condition for f to be bijective (so that f—! exists and is 
again a linear function) is that m = n and A has maximum rank, i.e. det A + 0. 
When / is bijective, f-! has matrix A-}. 


It is easy to see that a linear function on R™ into R” is continuous 
at all points of its domain (see exercise 3(b), 8). A consequence of 
the theorem below is that any linear function either is everywhere 
continuous or is nowhere continuous. 


Theorem 3.24. Let V, W be normed vector spaces. If the function 
J: V > W is linear, then the following three statements are equivalent. 
(i) fis continuous on V. 
(ii) There is a point Xx) € V at which f is continuous. 
(iii) || f(x)||/||x\] is bounded for x « V— {6}. 


Proof. (ii) => (i). Take any e > 0. Since f is continuous at Xp, 
there is a d > O such that ||x—x,|| < 6 implies || f(x)—f(%»)|| < «. 

Let c be any point of V and take any x such that ||x—cl| < 6. Then 
|(x-—Cc+X9)—X|| < 6 and therefore || f(x—c+x9)—f(%p)|| < ¢, ice. 
I £() —f(o)|| < ¢. Thus fis continuous at c. 

(i) > (iii). Suppose that || f(x)||/||x|| is not bounded. Then, given 
any integer 7, there is a point x,, € V— {6} such that || f(x,)||/||x,|| > 7. 
If v, = (n||x,||)-1x,, then |v,|| = 1/n and 

| _ |fn)| 
eal = Gap > 1 
Hence fis not continuous at 0. 
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(iii) > (ii). Let N = sup ||f(x)||/||x/|. Since /(@) = 9, the inequality 
IFO < Nl 


shows that fis continuous at @. | 


Given the normed vector spaces V, W, denote by L(V, W) the set 
of continuous linear functions on V into W. With the usual interpre- 
tation of addition and scalar multiplication of vector valued functions, 
L(V, W) is a vector space. Moreover it is easily verified that the 


equation fl =_ sup Weis] eL) =——G.22) 


defines a norm on L(V, W). 
Notes. (i) In (3.22) the symbol || .|| is used in three different senses. 


Gi) LI = sup [fODl- 
Se re = a 
Theorem 3.25. If (a;;) is the matrix of the function f ¢ L(R™,R"), then 


m n $ 
max |a,| < If <(% 3 ah) . (3.23) 
Us] 2 
Proof. Let x be any point of R™ and let y = f(x). Applying 
Cauchy’s inequality (2.13) to (3.21) we have 
sta tyh < (Zt) (Eat) +-+(S ot) (Sa 
i=1 1=1 4 


41=1 


= 


ie Int? < (3 Sa) bo 


This proves the right-hand inequality in (3.23). 
Next, if |a,,| = max |a;;|, let x* be the vector with pth coneiositet | 
a, J 


and all other components 0. Then ||x*|| = 1 and 
IFO*)|| = (Gin t+..+Gnp)t > |Ap|- | 


Exercises 3(5) 
1. Prove the second part of theorem 3.23 without using the first part. 
2. Let CX, p), (Y, 7) be metric spaces. Show that f: X > Y is continuous at the 
point x, if and only if, given any open set Nin Y which contains f(x), there is 
an open set M in X such that x, € M and f(M) € N. 


3. Construct a function fon a metric space CX, p) which is discontinuous at all 
points of a dense subset E of X, but is such that the restriction of f to E is con- 
tinuous on E. 
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4. Let X, Y be similar, infinite sets and suppose that their metrics p, ¢ are such 
that CX, ) has no limit points while (Y, o) has at least one limit point. Show that, 
if the function f: X > Y is bijective, then f is continuous on X, but f~! is not 
continuous on Y. (Thus CX, p), (Y, 7) are not homeomorphic.) Give an example 
of such a function f. 


5. The metric spaces (X, p), (X, 7) have the same underlying set X. Let 
I: (X, p) > (X%, 2) 


be the identity function given by J(x) = x. Show that J is a homeomorphism 
if and only if p and o are equivalent metrics. Give an example to show that, when 
Pp, © are not equivalent, J may be continuous without J-1 being continuous. 


6. Let f be the function with domain X = [0, 1) U[2, 3] and range Y = [0, 2] 
defined on p. 37. Find a set G open in X and a set F closed in X such that f(G) 
is not open in Y and f(F) is not closed in Y. 


7. The function g with domain (— 0, «) and range [0, 00) is given by 
g(x) = (x—- 1)? e*. 


Show that there are open (closed) sets in (— 0, 00) which are mapped by g into 
sets that are not open (closed) in [0, 0). 


8. Let (X, ~) be a metric space and let ff, ..., ff, be functions on X to R'. The 
function f = (f,, ..., fn): X > R” is given by 


f(x) = (Ai@), ..-. fa). 


Prove that fis continuous at the point x, € Xif and only if, A, ..., f, are continuous 
at Xo. 

Hence, or otherwise, show that every linear function on R” into R” is every- 
where continuous. 


9. Let CX, p), (Y, 7), (W, 7) be metric spaces. To define continuity of a function 
on Xx Y into W we use one of the equivalent metrics on Xx Y of exercise 
2(a), 11. 

Given f: Xx Y> W and x,€ X, yo € Y, define the functions g on X and 
h on W by 


E(x) = f(x,y) EX), hh) =f%,.y) We Y). 


Show that, if f is continuous at (%o, y,), then g, A are continuous at Xp, Yo re- 
spectively. Construct an example to show that g, A may be continuous at Xo, Vo 
respectively without f being continuous at (Xo, Yo). 


10. Let D be the vector space of real differentiable functions on the interval [0, 1] 
and define the norm on D by 


|¢| = — {Pe (ge D). 


Let f: D > R' be the linear function defined by 
f(¢)=$¢O eD). 


Show that fis not continuous. 
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11. The linear function f: R" > R? is given by 


its 4 AG. Fd, 
Find || f]. 


12. Let U, V, W be normed vector spaces and let f: U > V, g: V-> W be con- 
tinuous linear functions. Show that 


lgofll < Igilfl. 


Deduce that, if fhas a continuous inverse f-!, then 


IFN F] > 1. 


Give an example of a linear function f such that 


[FIFA] > 1. 


3.3. Connected metric spaces 


In the classical account of continuity the more striking results, 
such as those relating to bounds and intermediate values are proved 
for continuous functions defined on finite closed intervals. These 
results do not hold for arbitrary continuous functions, but they can 
be regained by suitably restricting the domain. The theme of the 
remainder of this chapter is the relation between the structure of a 
metric space and the properties of the continuous functions defined 
on it. 

We remarked in chapter 2 that there exist metric spaces (X, p) in 
which sets other than @ and X are both open and closed. A space 
in which there are no such sets is called connected. Since a set is 
both open and closed if and only if its complement is both closed 
and open, the condition for a space not to be connected is that it is 
the union of two non-empty, disjoint open sets. The notion of con- 
nectedness can also be applied to subsets of a space. The subset E of 
X is said to be connected if the metric space (E, 7) is connected, 
where o is the metric induced by (X, p). In view of theorem 2.35 we 
may therefore formulate the definition of connectedness in the 
following way. 


Definition. Let (X, p) be a metric space. 

(i) The open subset G of X (which may be X itself) is said to be 
connected if there do not exist two non-empty open sets G,, G, such 
that G.:NG,= 2 and G,UG,=G. 

(ii) An arbitrary subset E of X is said to be connected if there do not 
exist open sets G1, G,, such that 


G,N£ + Dy Gol + B, GillG,; ak =a, G,UG, > E. 
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It is clear that part (i) is a particular case of part (11). Note that, 
by definition, the empty set is connected. 

The appropriateness of the word ‘connected’ is shown by the 
next two theorems. 


Theorem 3.31. A necessary and sufficient condition for a non-empty 
set in R' to be connected is that it is an interval. 


Proof. 

Necessity. Let E be a non-empty set in R! which is not an interval. 
Then there are three real numbers a, g, b such that a < q < b and 
ace E,be Ewhileg ¢ E. The sets G, = (—, q) and G, = (g, ©) are 
open and intersect E since they contain a, b respectively. Also clearly 


GiNG,NE= @ and Gi G, = E. 


Therefore E is not connected. 
Sufficiency. Let I be an interval and let G,, G, be open sets which 
intersect J, but are such that 


G;NiGaih= o . 


Take a point ae G,nJ and a point be G,n J. Since a + b, we may 
suppose that a < b. Then [a, b] is a subinterval of J. Let u be the 
upper bound of the set G, n [a, b]. Since a, b are interior points of G, 
and G, respectively and G, N G, N [a, b] is empty, it follows that 


=u <d. 
If now 6 is any positive number such that 
a<u—60 <u+6é < B, 


(u—6, u] contains at least one point of G, and (u, u+<¢) contains no 
points of G,. Hence (u—6,u+0) cannot be a subset of either G, 
or G,. Since G,, G, are open, it follows that u¢G, and u¢ G,, so 
that G, UG, > I. Therefore J is connected. | 


Theorem 3.32. A necessary and sufficient condition for a (non-empty) 
open set in R™ (or Z) to be connected is that any two of its points may 
be joined by a polygon lying entirely in the set. The polygon may be 
chosen so that its sides are parallel to the coordinate axes. 


Proof. ; 
Necessity. Let G be open and connected. Take a point c in G and 
let G, be the set of points in G which can be joined to,c by a polygon 


3.3] CONNECTED METRIC SPACES 45 


in G. Let G, = G—G, so that G,NG, = @ and G,UG, = G. We 
shall see that G, and G, are also open. 

If x is any point of G,, there is a B(x; e) < G. All points in B(x; €) 
can be joined to x by a line segment in B(x; €) and so in G. Since x 
can be joined to c by a polygon in G, so therefore can all points in 
B(x; 6). Hence B(x; ¢) < G,. Thus G, is open. The same kind of 
argument shows that G, is open. 

Since G is connected and is the union of the two disjoint open sets 
G,, Gz, at least one of these sets is empty. But ce G, and so G, is 
empty. Therefore G = Gj, i.e. all points of G can be joined to c by 
means of a polygon in G. Finally, any two points of G can be joined 
by way of c. | 

The polygon could be one with sides parallel to the axes, for a 
point in an open ball of R” may be joined to the centre by n segments 
parallel to the axes. 

Sufficiency. Suppose that G is open and that any two points of G 
may be joined by a polygon lying entirely in G. Let G,, G, be non- 
empty, disjoint open sets contained in G. We have to show that 
G, UG, + °G. 

Take a point c, € G, and a point c, € G,. There is a polygon in G - 
which joins c, and c,. This polygon must have a segment with end - 
points p, g such that p €¢ G, and ge G,. The segment may be repre- . 
sented by the vector equation 


x= x(t)=pt+(q—-p)t O<t< 1). 


Let u be the upper bound of the numbers ¢ such that x(f) € G,. 
Arguing in much the same way as in the proof of theorem 3.31 we 
see thatO < u < 1 and x(u) does not belong to G, or to G,. Therefore 
G, UG, + @ 


Corollary. The spaces R” and Z are connected. 


An open, connected subset of a metric space is called a region. 
Regions in Z will prove to be the natural domains of complex func- 
tions (chapter 10). 


Definition. Let (X, p) be a metric space and let E <— X. A set C is 
called a component of E if it is a maximal connected subset of E (i.e. 
if C is a connected subset of E and if no other connected subset of E 
contains C). 

It is an immediate consequence of the lemma below that any two 
distinct components of a set are disjoint. 
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Lemma. Let € be a collection of connected sets E in a metric space. 
If () Eis not empty, then U E is connected. 
EHeé Ee€é 


Proof. Suppose that H = U E is not connected. Then there are 
Heé 
open sets G,, G, such that 
GVH +} ©, Gna + ae ie. =, G_UGs— 


Let ae () E. Since ae H, we may assume that ae G,. If E is now 
Heé 


any set of 6&—,ae¢ Eand E < H so that 
G,NE + 6, G,AGNE = 6, .G,UG, > E. 


As £ is connected, we must also have G,n E = @. It follows that 
G, H = @, which is a contradiction. | 


Theorem 3.33. Any set in a metric space has a unique decomposition 
into components. 


Proof. Let E be a set in a metric space. For any x € E, denote by 
C,, the union of all the connected subsets of £ which contain x. Since 
{x} is connected (see exercise 3(c), 2), C, is not empty and, by the 
lemma, C, is connected. C, is also a component. For if C is a con- 
nected subset of E which contains C,, then C contains x and so, by 
definition of C,, C, > C. Finally it is clear that any component of E 
is a set C,. | 


Corollary 1. An open set in R” or Z is the union of countably many 
disjoint regions. 

Proof. Let G be an open set in R” or Z and let C be a component 
of G. If xe C, then C, = C. Also there is a 0 such that B(x; 6) < G 
and, since B(x; 0) is connected (theorem 3.32); B(x; d) < CG, = C. 
Thus C is open, i.e. C is a region. It now follows from exercise 
2(c), 5 that the set of components of G is countable. | 


Corollary 2. An open set in Ris the union of countably many open 
intervals. 


Theorem 3.34. If the domain of a continuous function is connected, 
then so is the range. 


Proof. Let (X, p), (Y, 7) be metric spaces and suppose that the 
function f: X > Y is.continuous. If f/(X) is not connected, then Y 
contains open subsets G,, G, which intersect f(Y) and are such that 


GiNG,nf(xX) = @ and G,UG, > f(X). (3.31) 
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By theorem 3.23, f-'(G,) and f-(G,) are open. These sets are also 
non-empty, since G, nN f(X) and G,nf(X) are non-empty. Moreover, 
by(3.30), 


ff 4a) = eS and (1G) S1G,).= Xx 


and so X is not connected. Hence if X is connected, then so is f(X). | 


Corollary. If the domain of a real-valued, continuous function is 
connected, then the range is an interval. 


Exercise. Prove that, if the real function fis continuous on the interval [a, 5], 
then fassumes in this interval every value between f(a) and /(d). (The intermediate 
value theorem of classical analysis, theorem 3.6 in Cl.) 


Exercises 3(c) 


1. Suppose that E < Xf Y and that the metrics p, 7 on X, Y respectively are 
identical on E. Show that E is a connected set in (X, p) if and only if it is a 
connected set in ( Y, 0). 


2. Show that, in any metric space, a set consisting of a single point is connected; 
and that a set which consists of more than one point and contains an isolated 
point is not connected. 


3. Prove that a metric space (X, p) is connected if and only if its only subsets 
with empty frontier are @ and X. 


4. Show that in the space of irrational numbers the only connected sets are those 
consisting of single points. 


5. Prove that, if a subset E of a metric space is connected, then E is also con- 
nected. If & is connected, is E necessarily connected ? 


6. Show that the components of a closed subset of a metric space are closed. Are 
the components of an open subset necessarily open? 


7. Show that the subset 
{(x, |x = 0, -1 < y < I} U{@, y)|0 < x < 1, y = sin (1/x)} 


of R? is connected. 


3.4. Complete metric spaces 


Readers are familiar with the idea of a convergent sequence. We 
now introduce a closely related concept. 


Definition. The sequence (x,) of points in the metric space (X, p) 
is said to be a Cauchy sequence if, given e > O, there is an ng such that 


P(Xms Xn) < € Whenever m,n > Np. 
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Theorem 3.41. If (x,) is a convergent sequence in a metric space 
(X, p), then it is a Cauchy sequence. 


Proof. Let lim x, = x. Given € > 0, there is an nm, such that 


no 


P(X», xX) < € for n > ny. If now m,n > Nn, 
DM Xias Xn) <P Gas 8) Py A) = 284 


The converse of theorem 3.41 is not true in every metric space. 
For instance if X is the interval (0, 1) of R14, then the sequence (1/n) 


is a Cauchy sequence, for, when e > 0, 
io 1 
———-|<¢é if mMn> -. 
m n € 


However (1/n) does not converge in (0, 1), since there is no point 
x in (0, 1) such that 1/n—x +0 as n-— oo. Another subspace of R? 
in which the converse of theorem 3.41 fails is the metric space of 
rational numbers. (See exercise 3(d), 2.) 


Definition. Let (X, p) be a metric space. A subset E of X (which may 
be X itself) is said to be complete if every Cauchy sequence in E has a 
limit in E. If X is complete, we say that the metric space (X, p) is 
complete. 


Notes. (i) In any metric space the empty set is complete (since it contains no 
Cauchy sequences). 

(ii) The non-empty set E in the metric space (X, p) is complete if and only if 
the metric subspace (E, 7) of (X, p) is complete. 

We shall establish the vital result that R! is complete. First we prove 
a lemma which has independent interest. 


Lemma. Every sequence of real numbers has a monotonic subsequence. 


Proof. Let (x,) be any sequence of real numbers. We call x, a 
terrace point if xX, < X, for alln > p. 

If there are infinitely many terrace points, let x,, be the first, x,, the 
second, and so on. The sequence (x,,) is clearly decreasing. 

If there are finitely many terrace points (perhaps none), take an 
integer v, so that no x, form > 1,, is a terrace point. There is now 
a v, > v, such that x,, > x,,. Again there is a vs > v, such that 
x,, > X,,3 and continuing in this way we obtain an increasing sub- 
sequence (x,,). | 


We do not need the full strength of the lemma, but only the special 
case that a bounded sequence has a convergent subsequence. 
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Theorem 3.42. The space R} (with its usual metric) is complete. 

Proof. Let (x,) be a Cauchy sequence in R*. We first show that 
(x,,) is bounded. Since there is an integer p such that 

\X»—Xm| < 1 whenever m,n > p, 
it follows that |x,| < |x,—x,|+|xp| < |x,|+1 
whenever 1 > p. Thus, for all n, 
[Xn] < max (|x|, [x2], ---5 [Xp—al> [Xp] + 1). 

It now follows from the lemma that (x,,) has a bounded, mono- 
tonic and therefore convergent subsequence, say (x,,). Suppose that 
as. as ho, 

We shall show that x, > x as n> oO. 


Take any positive e. Since (x,) is a Cauchy sequence, there is an 
integer 7%) such that 


lXm—Xn| < € for m,n 2 No. 
Also since x,, > x, there is an integer k, such that 
Ix,—-x| <¢ for k 2 ky 
As v,, > k, we then have, for n > max (Mp, Ko), 
|Xn—x| < |Xn—%,,| +|%,—%] < 2¢. 
Thus x, >xasn—>oo. | 


Corollary. The spaces R™ and Z (with their usual metrics) are com- 
plete. 


Proof. We need only consider R”. Let (x,) be a Cauchy sequence 
in RY 4 fe it). ten 
a =i Exel 
ena eee 6? 6.6 < P(Xps X;) 
ae es Enel 


and so each of the sequences (£,,),>15 «++» (Snr)r>1 18 a Cauchy sequence 
in R!. Therefore, by the theorem, there are real numbers §), ..., &, 


such that a Sie 
asr—> oo. If x = (&,..., &,), we then have 
P(X x) < |Ean— Er] +..- + [Snr — Sn] 


and, since the right-hand side tends to 0 as r > 00, it follows that 
X,>xasr—>o. | 
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Theorem 3.41 and the fact that R! and Z are complete may be 
combined in the following statement. A necessary and sufficient 
condition for the sequence (x,,) of real or complex numbers to converge 
is that, given ¢ > 0, there is an integer ny such that |x,—X,| < € 
whenever m, n > ny. In classical analysis this is referred to as the 
General Principle of Convergence. 

For subsets of metric spaces there is an intimate relation between 
the properties of being complete and of being closed. 


Theorem 3.43. Let (X, p) be a metric space and let E be a subset of X. 
(i) If E is complete, then E is closed. 
(ii) If X is complete and if E is closed, then E is complete. 


Proof. (i) Let (x,,) be a sequence of points in E which converges 
to a point x € X. By theorem 3.41, (x,,) is a Cauchy sequence in X and 
so in E. Since E is complete, it follows that (x,,) converges in £, i.e. 
that x e EF. Theorem 2.31 now shows that E is closed. 

(ii) Let (x,) be a Cauchy sequence in £. Since (x,) is a Cauchy 
sequence in X and X is complete, there is an x € X such that x,, > x. 
By theorem 2.31, xe E. | 


Several important metric spaces have functions as their elements. 
We shall now consider two such spaces which we have previously 
met in specialized form. 

Let (Y,7) be a metric space. The subset FE of Y is said to be 
bounded if there is an open ball that contains E; and, if X is an 
arbitrary set, the function ¢: XY > Y is called bounded if the subset 
¢(X) of Y is bounded. We denote by BCX, Y) the set of bounded 
functions on X into Y and we define the usual metric p on BCX, Y) 


YG, W) = sup 716), VO) WE BLY, YY). 


When X is also equipped with a metric, so that continuity on X has 
a meaning, we denote by C(X, Y) the subspace of BCX, Y) consisting 
of the bounded, continuous functions on X into Y. The usual metric 
on C(X, Y) is the one induced by (B(X, Y), p). When Y = R}, 
B(X, Y) and C(X, Y) are usually written BLY) and C(X) respectively. 
We have also previously used, and we retain, the notation Bla, 5] 
and C[a, b] for B(X) and C(X) when X is the interval [a, 5]. 


Theorem 3.44. If X is any set and (Y, 7) is a complete metric space, 
then B(X, Y) is complete. 
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Proof. Let (¢,) be a Cauchy sequence in BCX, Y) and let x be any 
point of X. Since 


- TPmnl(X)s Pn{X)) < oe (Pin(S)s Pn(E)) = PCPs Prs 


it follows that (¢,,(x)) is a Cauchy sequence in the complete metric 
space ( Y, 7). Therefore lim ¢,,(x) exists for every x e X. Leto: X > Y 


be the function given by 


O(x) = lim Pa(x) (% € X). 


The next step is to show that ¢ € B(X, Y). There is an integer p 


such that P(Pn» Pn) < 1 Whenever m,n 2 p. 


Therefore, for any x € X, 


T(Pin(X), Pn(X)) < 1 whenever m,n > p. (3.41) 
Taking n = p and letting m— oo in (3.41) we get (See exercise 
eee ($2), Gp()) < 1. 


Since this holds for all x € X and since ¢, is bounded (i.e. ¢,(X) is 
contained in an open ball), it follows that ¢ is bounded. 3 

We must still prove that ¢,, > ¢, ie. that p(¢,,, ¢) > 0 (which is a 
stronger assertion than the statement that ¢,(x) > ¢(x) for every x 
—see exercise 3(d), 7). Given e > 0, there is an m,) such that 


P(ms Pn) < € Whenever m,n > MN 


and so, by an argument similar to the one we have just used, we see 
that, for every xe X, 


7($(x), $,(x)) < € Whenever 1 > Np. 
Thus 


P(A, ,,) = sup 7(f(x), $»(x)) < € whenever n> ny 
and so ¢, >@. | 


Corollary. If X is any set, the space B(X) is complete. 


Theorem 3.45. If (X, 7) is any metric space and (Y,7) is a complete 
metric space, then C(X, Y) is complete. 


Proof. If (¢,,) is a Cauchy sequence in C(X, Y) and so also in 
B(X, Y), then, by theorem 3.44, there is a ¢ € B(X, Y) such that 
d, > asn-—> oo. We have to show that ¢ is continuous on X. 
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Let x) be an arbitrary point of X and take e > 0. There is an n 
such that 
PCS, bn) = sup T(9(x), bu(@)) < €: 


and, since ¢,, is continuous, there is a d > 0 such that 
7($,(Xo), Pa(x)) < € Whenever o(Xp, x) < 0. 
Hence, when o(X, x) < 34, 
T(P(Xo)s P(X) < (P(X), Pn(Xo)) + TPn(Xo)s Pnl(X)) + Tul) PX) 
< 3e, 


Thus ¢ is continuous at x, and, since x) was an arbitrary point of X, 
¢ is continuous on X. | 


Corollary 1. If (X, 7) is any metric space and (Y,7) is a complete 
metric space, then C(X, Y) is closed in B(X, Y). 
Corollary 2. If (X, 7) is any metric space, the space C(X) is complete. 

We end this section with a result that has many applications in 
analysis. | 
Definition. Let (X,p) be a metric space and let Q: X > X be a 
function mapping X into itself. If now there is a non-negative number 
k < 1 such that : ; 

P(Q(x), Q(x’) < kp(x, x’) 

for all x, x' € X, then Q is called a contraction mapping. 

Clearly a contraction mapping is continuous. 


Theorem 3.46. (Banach’s fixed point principle.) If Q is a contraction 
mapping on a complete metric space (X, p), then the equation Q(x) = x 
has one and only one solution (i.e. the mapping Q: X + X leaves one 
and only one point unchanged). 


Proof. Let xo be an arbitary point of XY and define the members of 
the sequence (x,,) by the recurrence relation 


Xn+1 = OX) (n 2 0). 
Then PXins Xmer) = P(2%m—-1)s 2% m)) 


S Kp(Xm-1s Ky) 


S k™ (Xo; xy) 
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and 
P(Xns Xn+r) < K(X» Xr) 
< k"{p(xo, X1) + 0(%1, X2) +... +POr-v Xf . : 
< k"p(xo, x) (l+k+... +k) | 


kn 
< [aK Po X}). | 


Since k” >0 as n> ©, (x,) Is therefore a Cauchy sequence and X, 
being complete, contains a point x such that x, > x. Now, since 2 
is continuous, by theorem 3.22, 


O(x;,) > Q(). 
But also = x 


and therefore Q(x) = x. 
To prove the uniqueness of the fixed point, suppose that 


ix) =x and. Of>') = x. 
Then p(x, x’) = p(Q(>x), Q(%')) < kplx, x’) 


and so, since k < 1, p(x, x) =O, i.ecx = x] 


Many theorems on the existence and uniqueness of the solutions 
of differential, integral and other equations are traditionally proved 
by various devices involving successive approximations. The essence 
of these procedures is embodied in Banach’s fixed point principle 
whose use therefore greatly simplifies the classical proofs. We illustrate 
the method by proving a theorem on implicitly defined functions. 


Theorem 3.47. Let (Xo, Yo) be an interior point of a set E in R*® and 
suppose that the function f: E > R? satisfies the following conditions. 
(i) I (Xo: Yo) se a 
(ii) f is continuous in an open set G containing (Xp, Yo); 
(iii) f,, exists in G and is continuous at (X, Yo), NA fy(Xo, Yo) + 9. 
Then there exist a rectangle 


MxN = [xp—%, X% +] x[vo—P; YotF] 


and a continuous function 6: M > N such that y = $(x) is the only 
solution lying in M x N of the equation f(x, y) = 9. 


Proof. Put 2 1 
t Hews 
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Since f,, is continuous at (Xp, Vo), there is a rectangle 
[Xo — 4, X9 +9] x [Wo—-F; Vo + A] 
contained in G in which 
l-g@a@ey < e | (3.42) 


Using the conditions (i) and (11) we can now find a positive number 
a < 6 such that, for xe M = [x)—@, x) +e], 


laf Yo)| < 4P. (3.43) 


By theorem 3.43, N = [y.—f,¥0+/f] is complete. Hence, by 
theorems 3.44 and 3.45, the spaces B(M, N), C(M, N), equipped with 
their usual metrics, are complete. Let be the mapping on B(V, N) 
defined by Q() = x, where 


xx) = W(x) -af YO) (xe M). 


Note that, if 7 is continuous, then y is also continuous. 7 
We first show that Q maps B(M, N) into itself. Let 7 ¢ BLM, N). 
Then, for x € M, we have, by the mean value theorem, 


QY~))—Yo = ¥X)—ah(% YX) —Vo 
= [¥x)—-af( YO) —-Do- af, Yo)l — 97: Yo) 
= [Y(x)— yl [1 —af,(x, w)]—9f(, Yo), 
where u lies between Yo and (x) and so in N. Hence, by (3.42) and 
ae QW (x) — yo] < HY) 9] +48 < B. 


We use a similar argument to prove that © is a contraction mapping. 
If 1, ’, € BLM, N) and xe M, 


Ob) (x) — 202) (&) = (Yi) -— af, ¥i))] - Yu) -— af, Y2%))] 
= [vi(x) a ¥(x)] [1 os gf, is v)I, 
where v lies between 7/,(x) and y,(x). Therefore, by (3.42), 
P(QY), L(%2)) < FeV, Wo). 


It now follows from theorem 3.46 that Q has a unique fixed point, 
which means that there is a unique function ¢: M+ WN such that 
Q(d) = d, i.e. f(x, d(x)) = 0 for xe M. (The uniqueness implies, 
in particular, that d(x.) = yo.) We still have to prove that the function 
¢ is continuous. But we have remarked that, if 7% is continuous, then 
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so is Q(). Hence, if Q* is the restriction of Q to C\M, N), then Q* 
is a contraction mapping on C(M, WN). Therefore Q* has a fixed 
point and, since a fixed point of Q* is also a fixed point of Q, it 
follows that ¢ is the fixed point of Q*. Thus ¢ belongs to C(M, N) 
and so is continuous. | 


A result similar to theorem 3.47 was proved in C1 (167) by entirely 
elementary means. But the present method of proof can be used for 
the general implicit function theorem involving systems of equations. 
(See theorem 7.43.) 


Exercises 3(d) 


1. Let (X, p) be a metric space and let E be a subset of X which has a limit 
point x». Suppose also that ( Y,o) is a complete metric space and that the 
function f : E > Y is such that, given e > 0, there exists 6 > 0 such that 


o(f(x), f(x’) < € whenever x, x’€ EN B(x; 4). 


Prove that lim f(x) exists. 
L—2XLy 


22Forn = lei. eet i 4 i 


Prove that (s,) is a Cauchy sequence in Q, the space of rational numbers. Show 
also that, for every integer g, q!s, is an integer and 0 < q!(e—s,) < 1 (where 


oe) 
e = >) 1/n!). Deduce that e is not rational and that Q is not complete. 
0 


3. Let D be the space of real functions differentiable on [0, 2] and let p be the 
metric on D induced by B[0, 2]. Show that (D, p) is not complete. 


4. The metric o on R' is given by 


Bg 


o(x,y) = ee 
a. 1+|x| 1+|»| 


| GreR?. 


Show that (R!, c) is not complete. 


5. The metrics p, 7 on a set X are equivalent. Are (X, p) and CX, c) necessarily 
both complete or both not complete? 


6. (i) Prove that the intersection of any collection of complete subsets of a 
metric space is complete. 

(ii) Prove that the union of a finite number of complete subsets of a metric 
space is complete. 


7. If (X, p) is a metric space, we define the diameter p(E) of a bounded subset 
Eof Xb 
P(E) = sup p(%, x2). 
XY, 2,€H 
Prove the following result. 
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Cantor’s intersection theorem. If (_X, p) is a complete metric space and (F,,) is 
a contracting sequence of non-empty, closed sets in X such that p(F,,) > 0 as 


io 6) 
n> oo, then () F, consists of exactly one point. 
n=1 
Show that, if any one of the conditions 


(i) (X, p) is complete, (ii) F, is closed, (il) p(F,) > 0 


is omitted, then () F, may be empty. 

n=1 
8. Give an example of a sequence (¢,,) and a function ¢, all in B[O, 1], such that 
d,(x) > $(x) for every x € [0, 1], but ¢, +> ¢. 


9. Let V, W be normed vector spaces. Prove that, if Wis complete, then L(V, W) 
is complete. 

(Note that the metrics on L(V, W) and on C(V, W) are defined quite differently. 
The only element common to the two spaces is the zero function ©: V> W 
such that O(x) = 6 for all x € V.) 


10. The real function fis continuous on the interval [a, b] and differentiable in 
(a, b); Also a =f) = b for a < x = b and |f @)| = k < 1 fora <x < 3B. 
Prove that equation f(x) = x has one and only one solution in [a, 5]. 


a 


11. Prove the following implicit function theorem. 

Let S be the strip [a, b]x(—, 00) of R* and suppose that the function 
ff: S > R' satisfies the following conditions: 

(i) fis continuous; 

(ii) f, exists in S and there are constants m, M such that 


O<msfi(xy) <M 
for all (x, y) ES. 
Then there exists a continuous function ¢ on [a, 6] such that, for a < x < 5, 
y = (x) is the only solution of the equation f(x, y) = 0. 


12. Prove the following theorem. 

Let S be the rectangle [x,»—a, x» +a]x[yvo—5, yo + 6] of R?®. The function 
ff: S ~ R' is continuous and satisfies the following Lipschitz condition: There is a 
constant A such that 


| F &, yy) —f(%, ¥2)| < Al ¥1—Ye| for all (x, ys), (x, 2) ES. 
Also let | f(x, y)| < Bin S; and let « in [0, a] be such that 
a < min (1/A, 5/B). 
Then the differential equation : 
y 


i IGS); 


subject to the initial condition y = y) when x = Xo, has a unique solution in the 
interval [x»9—@, X¥)»+ a]. 

(First verify that y = (x) is a solution of the differential equation, subject 
to the given initial condition, if and only if ¢ satisfies the integral equation 


aoe I ” F(t, $(t)) dt.) 
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13. Let 2: R! > R' be defined by 
O(x) = x+1/(1+e?). 
Show that (i) Q has no fixed point, and (ii) for all distinct x,, x, € R’, 
|) — QA(%2)| < [xy — Xe]. 


14. Given Q: X > X, define Q® = Q and QO” = 100°) (n > 2). Show that, 
if the metric space (X, p) is complete and there is a positive integer r such that 
Q® is a contraction mapping, then 2 has a unique fixed point. 


15. The function 2 : C[0, 1] ~ C[O, 1] is defined by 
)(x) = : “g(t)dt (bE ClO, 1], x€ [0, 1). 


Prove that Q'is not a contraction mapping, but that Q® is a contraction mapping. 


3.5. Completion of metric spaces 


In this section the notion of isometry, defined on p. 39 is essential. 
We also recall (see exercise 2(c), 11) that, if (Y, 7) is a metric space, 
the subset D of Yis dense in Yif D = Y. 


Definition. Let (X, p) be a metric space. The complete metric space 
(Y, c) is said to be a completion of (X, p) if it contains a metric sub- 
space (X, 7) such that 

(i) (X, p), (Xo, %) are isometric, and 

(ii) X, is dense in Y. 


Theorem 3.5. Every metric space has a completion and all the com- 
pletions of the space are isometric. 


Proof. Let (X, p) be a metric space. We begin by constructing a 
metric space (X*, p*) which will be seen to be a completion of (X, p). 

(i) Definition of X*. We call two Cauchy sequences (x,,), (x,) in 
(X, p) equivalent and write (x,) ~ (x;) if p(x,, Xn) +0 as n> oo. It 
is easily checked that we have, in fact, defined an equivalence relation. 
We denote by X* the set whose elements are the equivalence classes 
generated by this relation. 

(ii) Definition of p*. Let x*, y* be two elements of X* and let 
(x,)> (¥,) be arbitrary members of x*, y* respectively. Then (x,), (Vn) 
are Cauchy sequences in (X, p) and 


| (Xm Vn) —P(Xns Yn) S Pms-Xn) + PW Vn)- 


Therefore (p(x, ¥,)) is a Cauchy sequence in R? and, since R?* is 
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complete, lim p(x,, y,) exists. Moreover, if (x,,), (y,) are any other 
members of x* and y*, i.e. (x,) ~ (x,) and (y,) ~ (7), then 
[POns Yn) — Pn» Vn)| < PXns Xn) + PV ns Vn) 
andso Tn Ons Vn) = = P\%ns Yn) 
We are therefore justified in defining the function p* on X¥* x X* by 
P(x", YT) = Ei pln» Vn), 


but we must still verify that p* is a metric on X*. 

Clearly p*(x*, y*) > 0 and p*(x*, x*) = 0. Now suppose that 
prt, y*).= 0. If-&) ea", &,) <2", then-lim p(x, 33 
(x,) ~ (y,), and x* = y*. Thus M1 is satisfied. It is clear that M2 
holds. To prove M3 we take (x,,) € x*, (y,) € y*, (Z,) €z* and let 
n —> oo in the inequality 


Pas Zn) SPE Pa POs Zale 


(11) (X*, p*) contains a subspace isometric with (X, p). The subset 
X, of X* is defined as follows: x* € X, if and only if there is an 
x € X such that the (Cauchy) sequence (x, x, x,...) belongs to x*. 
This x* therefore consists of all the sequences in X which converge 
to x. Evidently different points x, y in X determine distinct members 
of X>. Thus the function f: X > X, defined by 


I(x) = x*, 


where x* is the equivalence class containing (x, x, ...), is bijective. 
If x, ye X and f(x) = x*, (Oy) = y*, so that (, x. ee 
(y, y, ..-.) € y*, then, by definition of p*, 


p*(x*, y*) = p(x, y). 


Hence f is an isomeiry. 

(iv) Xq is dense in X*. Take an arbitrary element x* of X*. Let 
(x,,) be a member of x* and, for each k, denote by x,,* the element of 
X, which contains the sequence (X;, x;, ...). Then 


pro, Xx") ox ee Xp) 


and, since (x,) is a Cauchy sequence, the right-hand side tends to 0 
as k > oo. Thus x,* > x*. This means that XY, = X*, i.e. that X, 
is dense in X*. 
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(v) (X*, p*) is complete. Let (x*,,) be an arbitrary Cauchy sequence 
in X*. Since X, is dense in X*, for each n there is a y,* € Xp such 
P*(X¥ ns Vn*) < Ifn. 

The equivalence class y,,* contains a sequence (V,; Yn, --.) of points 
in XY. Now 


PVs Yn) = P* Om; Yn*) 
OO i RO Le) 


are 
< PEO? is a or 


Since (x*,,) is a Cauchy sequence in X*, it follows that (y,) is a 
Cauchy sequence in X. Let y* < X* be the equivalence class contain- 
ing (y,,). Then 


p*(y*, x*,) <= p*0*, yz") + p*0.*, x*) < lim P(Vns Vx) + 1/k. 


As (y,) is a Cauchy sequence, the right-hand side tends to 0 as 
k + oo, i.e. x*, > y*. Hence (X*, p*) is complete. 

(vi) Any two completions of (X, p) are isometric. Let (Y, a) be a 
completion of (X, p) and let ( Yo, 7) be a subspace of (Y, o) isometric 
with (X, p). Proceeding as in (i) we group the Cauchy sequences 
(y,,) in Yo into equivalence classes y*. 

Since Y is complete, every Cauchy sequence in Y, has a limit in Y 
and clearly all Cauchy sequences belonging to the same equivalence 
class y* have the same limit y. Conversely, to any ye Y, there 
corresponds an equivalence class y*. For, since Y, is dense in Y, 
there is a sequence (y,) in Y,) which converges to y and y* is the 
equivalence class determined by (y,). We have therefore set up a 
bijection between the equivalence classes y* and the points y of Y. 

Let (W, 7) be another completion of (X, ») with subspace (W4, 75) 
isometric to (X, p). As before, we have a bijection between the points 
w of W and the equivalence classes w* of Cauchy sequences in W,. 

The spaces (Yo, 7) and (W,, 79) are isometric, since each is iso- 
metric with (X, p). If (y,,), (w,) are corresponding sequences in Y, 
and W, respectively, then | 


TVs Vn) = T Wing Wn) 


and therefore, when one is a Cauchy sequence, so is the other. 
Moreover, if (y,,), (w,,) also correspond, then (y,,) ~ (y,) if and only 
if (w,) ~ (w,). Hence the isometry between (Yo, 7) and (W4, 75) 
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induces a bijection between the set of y* and the set of w* and thence 
leads to a bijection between Y and W (which preserves the original 
bijection between Y, and W). 

Now let y, y’€ Y and w, w’ € W be any corresponding pairs of 
points. Let (y,), (vj) and (w,,), (Wx) be corresponding sequences from 
Y, and W, respectively such that y,>y, ¥n>y and w, >w, 
w, >w’. In view of the isometry between (Y>, %) and (W, T)) we 
have oy, y') = lim ons 3%) = lim 7(W7q, wa) = 710, W’) 
Thus the bijection between (Y, o) and (W, 7) is, in fact, an isometry. | 


It is now easy to construct a completion of (X, p) which actually contains 
CX, p) as a subspace. Let yt = Xu (X*-X). 


We obtain a natural correspondence between the points x* of X* and the points 
x* of X* by using the bijection between X and X, and by letting the points of 
X*— X, in X+ and in X* correspond to themselves. The metric p* on A* is then 


alco prox, v4) = ptt, y*). 


(X+, pt) is the desired completion of (X, p). We have proved that any metric 
space can be embedded in a complete metric space which is the closure of that space. 


Exercises 3(e) 


1. Let (X, p) be a metric space. Show that, if (Y, 7) is a complete metric space 
containing a subspace isometric with CX, p), then (Y, 7) contains a completion 
of (X, p). 

2. Show, by means of an example, that a metric space (X, p) may have com- 
pletions (Y, 0) and (¥;, 04) such that (Y;, 7) is a proper subspace of (Y, 7) 
(i.€. Y- 7; += @). 

3. Given the metric space (X, p), let (X*+, p*) be a completion which contains 


(X, p) as a subspace. Prove that a contraction mapping 2 on (X, p) has a unique 
continuous extension Q+: X+ > X+, and that ©* is a contraction mapping on 


(X*, p*). 


3.6. Compact metric spaces 


Among the properties of metric spaces considered by us the one 
with the most far-reaching consequences is compactness. 


Definition. Let (X, p) be a metric space. A subset E of X (which may 
be Xitself)is said to be compact if every sequence in E has a subsequence 
which converges in E. If X is compact, we say that the metric space 
(X, p) is compact. 
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Notes. (i) In any metric space the empty set is compact. (ii) The non-empty 
set E in the metric space (_X, p) is compact if and only if the subspace (£;0) of 
(_X, p) is compact. 


Illustrations 


(i) In any metric space, a set of only a finite number of points is compact. 

To prove this we need only remark that the empty set is compact and that, if 
(x,) is a sequence in the set {&), ..., &,}, at least one point €; appears infinitely 
often in (x,). 

(ii) Every finite closed interval of R” is compact. 

First consider an interval [a, b] of R*. If (x,) is a sequence in [a, b], then (x,) is 
bounded and, by the lemma to theorem 3.42, contains a subsequence (x,,) 
which converges to a point x € R. But, for all k, a < x,, < 5 and so 


a < lim x,, < b. 
k-> © 


Thus “Sis € [a, b] and so [a, b] is compact. 

Next, let J = [a, b] x[c, d] be a finite closed interval in R° and let (x,) be a 
sequence in J. If xn = (€:, Jn), then, since (€,) lies in [a, b], (€,) has a sub- 
sequence (£,,.) which converges to a point & in [a, db]. Put § = a 1 = Ue 
As 77; lies in [c, d], there is a subsequence (),) which converges to a point 7 
in [c,d]. Also &, > &, since & > &. But (&,) and (7,,) are subsequences (£,,) 
and (7,,) of (é,,) and (7,,) respectively. Therefore the subsequence (x,,) converges 


to the point x = (&, 7) in J. Hence J is compact. 

The argument leading from R! to R? may be adapted to make the inductive 
step from R”?! to R”. 

(iii) R" and Z are not compact. For instance in R* the sequence of positive 
integers has no convergent subsequence. 

(iv) C(X) (with its usual metric) is not compact. For, whatever metric X has, 


the functions ¢, (n = 1, 2, ...) given by 
P(x) =n (xe X) 
are continuous on X and the sequence (¢,,) clearly has no convergent subsequence. 
The last two illustrations show that acomplete metric space need not 
be compact. On the other hand, compactness implies completeness. 


Theorem 3.61. A compact set in a metric space is also complete. 


Proof. Let (X, p) be a metric space and let E be a compact set in X. 
A Cauchy sequence (x,) in E has a subsequence (x,,) which con- 
verges to a point x € E. Then 

PG x) < Xn Kad + P(Xyqs x) = 0, 
i.e. x, > x. (Essentially the same argument, in an expanded version, 
forms the last part of the proof of theorem 3.42.) | 


The property of compactness may also be expressed slightly 
differently. 


62 CONTINUOUS FUNCTIONS ON METRIC SPACES [3.6 


Theorem 3.62. A set E ina metric space is compact if and only if every 
infinite subset of E has at least one limit point in E. 


Proof. 

(i) Suppose that the set E in a given metric space is compact. Let 
A be an infinite subset of Z. Then A contains a countably infinite 
subset the points of which may be arranged in a sequence (x,). 
Since E is compact, (x,) has a subsequence (x,,) which converges to 
a point xe E. Clearly x is a limit point of the set {x,, x,,, ...} and 
so of the set A. 

(ii) Suppose that every infinite subset of EF has at least one limit 
point in E£. Let (x,,) be any sequence in E. The set of points {x,, x2, ...} 
may then be finite or infinite. If {x,, x.,...} is finite, then (as in 
illustration (i) on p. 61) at least one of its points occurs infinitely 
many times in the sequence (x,). Thus (x,) has a convergent sub- 
sequence. If the set {x,, x,, ...} is infinite, then it has a limit point 
x in £ and this is clearly the limit of a subsequence of (x,). | 


V9? 


A bounded set in R” lies in a bounded closed interval. Since 
such an interval is compact, we now have the following result: A 
bounded, infinite set in R” has at least one limit point. This is the 
Bolzano—Weierstrass theorem, one of the earliest results in point set 
theory. 

The next result is an analogue of theorem 3.43. 


Theorem 3.63. Let (X, p) be a metric space and let E be a subset of X. 
(i) If E is compact, then E is bounded and closed. 
(ii) If X is compact and if E is closed, then E is compact. 


Proof. 
(i) If E is compact then it is also complete and so closed by 
theorem 3.43 (1). The direct proof is also quite simple. 
Let a be a fixed point of X. If Fis not bounded, there is a sequence 
(x,) 1n E such that 
Ax > 0 tne tee, ak 


Thus p(x,, a) > co as n-—>oo. If (x,,) is any subsequence of (x,), 
P(X,,, 4) > 00 as k + co and so (x,,) cannot converge. Hence E£ is 
not compact. 

(ii) Let (x,,) be any sequence in E and so in X. Since Y is compact, 
there is a subsequence (x,,) with limit x in X; and, since E is closed, 
theorem 2.31 shows that x e E. Therefore E is compact. | 
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Corollary. In R” a set is compact if and only if it is bounded and closed. 
Although in the space R” the converse of (i) is true, in general it is 
false. A counter example is given in exercise 3(/), 3. 
In the rest of this section we investigate the properties of continuous 
functions with compact domains. 


Theorem 3.64. If the domain of a continuous function is compact, then 
the range is also compact. 


Proof. Let (X, p) be a compact metric space, let (Y,o) be any 
metric space and suppose that the function f: X¥ > Y is continuous 
on X. We wish to show that the subset /(Y) of Y is compact. 

Take any sequence (y,,) in f(X). For each n choose an x, such that 
(Xn) = Yn» (There may be several x for which f(x) = y,.) Since X 
is compact, the sequence (x,,) contains a subsequence (x,,) which 
converges to a point x € X. Then, by theorem 3.22, the continuity of 
f ensures that /(x,,) >/(x). Hence, if f(x) = y, y,, > y, where 
ye f(x). | 
Corollary 1. If the domain of a continuous function is compact, then 
the range is bounded and closed. 


Corollary 2. If the domain of a real-valued continuous function is 
compact, then the function is bounded and attains its upper and lower 
bounds. 

The second corollary follows from the fact that a bounded, closed 
set in R! contains its supremum and infimum (exercise 2(c), 3). It 
generalizes the corresponding elementary result in which the domain 
of the function is a finite, closed interval (C1, theorem 3.71). 


Theorem 3.65. If the domain of a bijective, continuous function is 
compact, then the inverse function is also continuous (i.e. the function 
is a homeomorphism). 


First proof. Let f: X > Y be a bijective, continuous function with 
compact domain (X, p) and with range (Y, o). 

Let (y,,) be a sequence in Y which converges to a point ye Y. 
We wish to show that f—1(y,,) >f—(y). Put f“(y,,) = Xn. Since X is 
compact, (x,) contains at least one convergent subsequence. Let 
(x,,) be such a sequence with limit x, say. Then, as fis continuous, 


Vy, = f%y) > £0). 
But y,, > y and so f(x) = y, i.e. x = f(y). Therefore every con- 
vergent subsequence of (x,) converges to x = f~*(y). It then follows 
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by the compactness of X (see exercise 3(/), 4) that x, > x, or 
F'n) > FO): 


Second proof. This is shorter, but more sophisticated. 

Let F be a closed set in X. By theorem 3.63 (ii), F is compact and, 
by theorem 3.64, f(F) is compact. Then, by theorem 3.63 (i), /(F) is 
closed in Y. Thus (f—!)7! (F) (.e. f(F)) is closed in Y whenever F is 
closed in XY. Therefore, by theorem 3.23 (ii), f-1 is continuous on Y. | 


An immediate corollary is the well known theorem (C1, theorem 
3.9) that a strictly monotonic, continuous, real function on an interval 
has a continuous inverse. ¢ 

The property of uniform continuity, possessed by real functions 
continuous on a closed interval, was briefly discussed in Cl (§3.8). 
It was used to prove that a continuous function is integrable. We 
now consider uniform continuity in the general setting of metric 
spaces. 

We recall that, when (XY, /) and (Y,o) are metric spaces, the 
function f: X¥ > Y is continuous at the point ce X if, given e, there 
is a 0 such that 


o( f(x), f(c)) < € whenever p(x, c) < 0. 
Here the number ¢ depends on c (as well as on e) and generally one 


cannot expect one ¢ to suit all c’s. When this is the case, i.e. when, 
given ¢, there is a 0 = d(€), independent of c, such that, for all c in X, 


o( f(x), f(c)) < € whenever p(x, c) < 40, 


then f is said to be uniformly continuous on X. Actually a slightly 
different, but equivalent form of the definition is more useful. 


Definition. When (X, p) and (Y,@) are metric spaces, the function 
ff: X > Y is said to be uniformly continuous on X if, given e, there is 
a 0 such that 


o(f(X1), f(%2)) < € whenever p(x,, X2) < 6. 
The condition is clearly equivalent to the set 
E(6) = {o(f(%1), £(%2))|P1 X2) < 9} 


being bounded and its supremum w(6) tending to 0 as 6 + 0. 

Uniform continuity of course implies ordinary continuity. That 
continuity does not generadly imply uniform continuity is easily 
shown by examples. 
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(i) The real function f on (0, 1) given by f(x) = 1/x is continuous, 
but not uniformly continuous. For . 
oe 


m1) << Xy, XH <1 1X, — xa < 6 
Ce =| 9- “62 1X, if 


E(6) = | 
=[i-j]o<x<a}=[lo<x<3l 


and so E(6) is not bounded. 
(ii) The real function g on R! given by g(x) = cos (x?) is also 
continuous, but not uniformly continuous since 


w(6) = sup |cos (x?) —cos (x)| = 2. 


lai < 


(For if x, = [(n+1)z]2, x, = (nm), then x;— x < n-.) 
With the help of the next theorem it is also easy to exhibit uni- 
formly continuous functions. 


Theorem 3.66. A continuous function with a compact domain is 
uniformly continuous. 


Proof. Let f: X > Y be a continuous function whose domain 
(X, p) is compact and whose range lies in an arbitrary metric space 
Yo). 

Suppose that fis not uniformly continuous. Then there exists an 
¢ > O with the property that, to every 6 > 0 there correspond points 
x, y € X such that 


p(x,y)< 8 and o(f(x), f(y) 2 ¢. 
Thus there are sequences (x,), (y,) of points in X such that 
P(X%ns Yn) < Ifn and o(f (Xn), Sn) 2 & 


Since XY is compact, the sequence (x,,) contains a convergent sub- 
sequence (x,,) with limit «, say. We have 


PV» ©) S POV ry Xy) +P ys &) < 1/¥ + P(X» &) > 0 
as k + oo. Hence Ri et andy, 08; 
and, since fis continuous, 
F(X) fe) and f(Vy,) > f(@). 
Thus o(f(%,,),/(,,)) > 9. (3.61) 
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However ((x,,, ¥,,)) 18 a subsequence of the sequence ((x,, y,,)) and 
therefore OU (XI Wn) 2 for every K. (3.62) 


As (3.61) and (3.62) are incompatible, fis uniformly continuous. | 


A simple example (for instance a constant function on a non-compact set) 
shows that the condition of the last theorem is not necessary for uniform con- 
tinuity. 

Exercises 3(f) 


1. The metrics p, c on a set X are equivalent. Prove that (_X, p) is compact if and 
only if (X, o) is compact. 


2. Prove, without using theorem 3.43, that a compact subset of a metric space is 
closed. 


3. Let B[O, 1] have its usual metric and let ¢ be the function defined by ¢(x) = 0 
for 0 < x < 1. Show that the bounded and closed set B(¢; 1) is not compact. 


4. Let (x,) be a sequence in a compact metric space. Show that, if every conver- 
gent subsequence (there is at least one) converges to the point x, then x, > x. 


5. (i) Prove that the intersection of any collection of compact subsets of a 
metric space is compact. (Show first that the intersection of a compact set and a 
closed set is compact.) 

(ii) Prove that the union of a finite number of compact subsets of a metric 
space is compact. 


6. Show that, in a compact metric space, every contracting sequence of non- 
empty, closed sets has a non-empty intersection (cf. exercise 3(d), 7). 


7. The function /: [a,oc) > R! is continuous, and f(x) > / as x > 0. Prove that f 
is uniformly continuous on [a, 0). 


8. The real functions f, g, h, k on [0, 0) are defined by f(x) = x’, g(x) = sin x, 
h(x) = (x+1)-! cos (x2), k(x) = x?. Which are uniformly continuous on [0, 00)? 


9. A real function on an interval [a, b] is called piecewise linear if there are points 
xi =. 0, 1,....,8#) such that a = x, < % <...< %. <%= b andie pe ee 
is linear on each interval [x;_,, x;] i = 1, ..., 7). 

Show that, given any function fe C[a, b] and any e > 0, there is a piecewise 
linear function g € C[a, b] such that p(f, g) < ¢ (where p is the usual metric on 
C[a, 5)). 


Note. In classical analysis and algebra a real function / is called /inear if it 
is of the form f(x) = ax + b (xé€ R}), i.e. if it has a straight line graph. The 
nomenclature is carried over to functions on R” to R! and to complex functions. 

This usage clashes with the definition of a linear function on p. 39 unless 
b=0. It is, however, long-established and expressive and we occasionally employ 
it. The context will make clear the interpretation intended. (The vector space 
terminology for the function f above is affine.) 


3.6] COMPACT METRIC SPACES 67 
10. The functions f, g: C[0, 1] ~ C[0, 1] are given by 


FONGy = I 4d aOG) = [eo ($€C[0, 1], xe[0, 1). 


Show that both functions are continuous on C[0, 1]. Are they uniformly con- 
tinuous? 


11. Let E be a compact set in a metric space (X, p). Prove that there are points 
x, y € E such that 
P(E) = p(x, y). 


(For the definition of the diameter p(E£) of E see exercise 3(d), 7.) Show also that, 
if E is assumed to be merely bounded and closed, then such points x, y need 
not exist. 


12. In a metric space CX, p), the distance between the point c and the subset 
E of X, denoted by p(c, E), is defined by 


p(c, E) = inf p(c, x). 
xeH 


Prove that the function f: X > R' given by 

f(x) = p(x, E) 
is uniformly continuous on X. Prove also that p(x, E) = 0 if and only if xe E. 
13. In a metric space CX, p), the distance between the subsets A, B of X, denoted 


by p(A, B), is defined by 
P(A, B) = = fo y). 


TEA, YE 


Prove that, if A, B are compact, there are points a € A, b € B such that 


P(A, B) = pG, 4). 
Prove also that in R” (with its usual metric) p(A, B) = p(a, b) for some 
aé A, be B if one of the sets A, B is compact and the other is closed. 


14. Let CX, p) be a metric space, let A, B be disjoint subsets of X and suppose 
that one of A, B is compact, the other closed. Prove that 

(i) p(A, B) > 0, 

(ii) there need not exist points a € A, b € B such that p(A, B) = p(a, d). 


15. Give an example of a metric space (X,/) with disjoint, closed subsets 
A, B such that p(A, B) = 0. 


3.7. The Heine—Borel theorem 


In theorem 3.62 we considered a property which was evidently 
closely related to that of compactness and we proved that the two 
are, in fact, equivalent. In this section we shall establish a character- 
ization of an entirely different kind. 


Definition. A collection L of sets A is said to be a covering of a set 
E if Ee ti A 


AEA 
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The covering is finite if S has a finite number of members only. When 
E and the sets A of & all lie in a metric space, the covering is said to be 
open if every set A of & is open. 

If & and &, are both coverings of E and %, <— &, we say that A, 

is, relatively to %, a subcovering of E. : 

Mlustrations : | 
(i) In Z let E = BO; 1) and let Ay = B(C; 1). Then E is covered by the sets 

Ape Cees 
(ii) The interval (0, 1) is covered by the open intervals 
| (fs) 16 se = 
-Gii) The set C[0, 1] is covered by the sets 
A, = {¢|¢ € BIO, 1]; n—1 < 60) < n+1} (= 0, +1, +2,...). 
The covering (i) clearly contains , finite silicide. The covering in (ii) 


contains only infinite subcoverings, while that in (iii) has no proper subcoverings. 


We shall show that a necessary and sufficient condition for a set 
in a metric space to be compact is that every open covering of the 
set should contain a finite subcovering. The sufficiency is easily 
verified; the proof of the necessity requires some preparation. 


Lemma-1, Let E be a compact set in a metric space (X, p) and let ¢€ 
be a given positive number. Then there is a finite number of points 
Ci... 0, nh Se at 


Ec U Be. 
n=1 


Proof. Take c, to be any point of E. If E, = E-— B(c,; 6) is not 
empty, let c, be any point of E,. Generally, if it has been possible to 
Choose Cyia.<> Cg and If 

ke 
E, = E—U Ben; €) 
n=1 
is not empty, c,,, is taken to be some point of E;. We wish to show 
that, after a finite number of steps, we arrive at an empty £,. 

Suppose that no E;, is empty, so that we obtain an infinite sequence 

(c,). When n > m, c, ¢ BC} €), 1.€. 


Pl Ons Ga) Foe 


Hence (c,,) cannot have a convergent subsequence (for such a sub- 
sequence would have to be a Cauchy sequence). This contradicts the 
hypothesis that Z is compact. | 
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Lemma 2. Let E be a compact set in a metric space (X, p) and let G 
be an open covering of E. Then there is a positive number « such that, 
for every x € E, the = ball B(x; am) is contained in some member 

G of F. | 


Proof. Suppose that the lemma is false. Then, for every n, there 
is an x, € E such that B(x, 1/n) is not contained in any G of &Y. 
Since E is compact, (x,,) has a subsequence (x,,) which converges to a 
point x* e E. Let G* be a member of Y which contains x*. Since G* 
is open, there isa d > Osuchthat _ 


—B(x*: 26) < G*. 

For all sufficiently large values of k, p(x,,, x*) < 6. Hence there is 
an integer r such that 
| | (0. x < © and “r > te. 
Hf then x 3G. ti). : 

p(x, x*) < p(x, X,) + 0%, x*) < 20 
i.e. x € B(x*; 28). Therefore 

o.. tir) = BGx*: 26) < G", 


This is a contradiction, since the B(x,,; 1/n) were chosen to be open 
balls not lying in any member G of %. | 


Theorem 3.7. (Heine—Borel.) A set in a metric space is compact if and 
only if every open covering of the set contains a finite subcovering. 


Proof. 

(i) Let E be a compact set in a metric space and let ¥ be an open 
covering of E. By lemma 2, there is an « > 0 such that, for every 
x € E, B(x; «) is contained in some G of Y. Having chosen «, we can, 
by lemma 1, find points c, ..., c, such that 


p 
Foe 243 Ble a). 
n=1 
Now, for each n such that 1 < n < p, there is a member G,, of F 


such that B(c,; «) < G,. Then 
Ec UG, 


n=1 


and so {G,, ..., G,,} is a finite subcovering. 
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(ii) Let E be a non-compact set in a metric space. Then, by 
theorem 3.62, E contains an infinite subset D which has no limit 
points in E. Therefore to each point c € D there corresponds an open 
ball G, = B(c;r,) such that G,n D = {c}. Moreover the open set 
G = (D)’ is such that GU D > E. Therefore 

G = {G} U{G,|c € D} 
is an open covering of E. But the only member of ¥ which contains 
a given point ce D(< E) is G,. Hence every subcovering must con- 
tain the infinite collection {G,|c ¢ D} and so Y contains no finite 
subcovering. | 
Exercises 3(g) 


1. A set E in a metric space is called totally bounded if it has the property which 
was proved for compact sets in lemma 1, namely that, given e > 0, there are 


points c,, ..., cp such that p 
E<= U Ben; €). 
n=1 


Prove that a totally bounded set is bounded and show, by means of an example, 
that a bounded set need not be totally bounded. 


2. Let E be a totally bounded set in a metric space. Show that, given a sequence 
(x,) in E, there are sequences (X17), (%o,n), (X3,n), ... Such that (x1, n) = (x,) and 
lor & >: 2, 

(i) (Xzn) is a subsequence of (Xz_1,n); 

(ii) (Xz, n) lies in an open ball of radius 1/k. 
Prove that (X,,,) is a Cauchy subsequence of (x,). 

Deduce that a set in a metric space is compact if and only if it is totally 
bounded and complete. 


3. Prove theorem 3.66 by means of the Heine—Borel theorem. 


4. The function f: [a, b] + R! is said to be increasing at the point x € [a, 5] if 
there is a 0(x) > 0 such that 

= flu) < f@) < fe) 
when ue (x: (x), x] N [a,b], vel[x, x+d(x)) N [a, 5). 


Show that, if f increases at every point of [a, b], then / increases in [a, d] (i.e. 
f@ <f(P)whena<a<f< b). 


NOTES ON CHAPTER 3 


§3.4. Although the fixed point principle has only recently become well known, 
S. Banach (1892-1945) proved it as long ago as 1920. It formed part of his 
doctoral thesis, which was published under the title ‘Sur les opérations dans les 
ensembles abstraits et leur application aux équations intégrales’ in Fund. 
Mathematicae 3 (1922), 133-81. | 


§ 3.6. What is now called the Bolzano—Weierstrass theorem was discovered by 
B. Bolzano (1781-1848), a profound Czech mathematician whose writings were 
largely ignored during his life-time. The theorem reached a wide public only 
when it figured in Weierstrass’s lectures at the University of Berlin. 


NOTES Fi 


§3.7. E. Borel (1871-1956) proved in 1894 that a covering of a finite closed 
(linear) interval by a countable collection of open intervals has a finite sub- 
covering. The restriction of countability was removed by H. Lebesgue (1875- 
1941) who made extensive use of the improved result in his theory of integration 
(1902). By 1905, theorem 3.7 (for bounded closed sets in R”) was essentially 
known. (See Borel’s note in Comptes Rendus 140 (1905, I), 298-300.) 

In 1871, H. E. Heine (1821-81) established the uniform continuity of a real 
function continuous on a finite closed interval. Since his argument contains 
the seeds of a covering theorem, the appellation ‘Heine—Borel’ has become 
attached to Borel’s original theorem and its extensions; but ‘Borel—Lebesgue’, 
which is sometimes used for the post 1902 versions, is probably more appropriate. 


Topological spaces. Once Fréchet had led the way to abstraction with his 
introduction of metric spaces, it was not long before mathematicians began to 
look for further generalizations. The neat definition below, which was arrived 
at only after a good deal of experimentation, rests on the observation that many 
properties of metric spaces and of continuous functions on metric spaces can be 
expressed by means of open sets without explicit mention of the metric. The 
definition of connectedness is already in this form; the Heine-Borel theorem 
gives an alternative definition of compactness in terms of open coverings; while in 
exercise 3(b), 2 and theorem 3.23 (i) open sets are used to formulate the property 
of continuity. 

Given a non-empty set X,a collection 7 of subsets of X is said to be a topology 
on X if it has the following three properties: 

(i) @ and X belong to 7, 

(ii) the union of any collection of sets in 7 belongs to 7, 

(iii) the intersection of a finite number of sets in 7 belongs to 7. 

If 7 is a topology on X, then the pair (X, 7) is called a topological space and 
the members of 7 are called the open sets of (X, 7). Since the collection of 
open sets of a metric space has the properties (i)-(iii), a metric space is a topolo- 
gical space. Equivalent metrics on a set generate the same topological space. 

A topological space is called metrizable if there exists a metric on the under- 
lying set such that the collection of open sets produced by it coincides with the 
given topology. There are simple examples to show that not every topological 
space is metrizable. For instance, if X is an infinite set, let its topology consist of 
% and all subsets G of X such that G’ is finite. If p is any metric on X, let a, b be 
arbitrary distinct points of X and put p(a, b) = r(>0). The open balls B(a; tr), 
B(b; 4r) are both open in (X,p), but, since they are disjoint, they cannot both 
have finite complements. B(a; 4r) and B(b; 4r) therefore cannot both be members 
of the given topology. Thus topological spaces are genuine generalizations of 
metric spaces. 

Although many features of metric spaces survive the transition to topological 
spaces, there are, inevitably, some casualties. The most notable of these is the 
property of completeness; for the notion of a Cauchy sequence has no interpre- 
tation in topological spaces. (The theory of uniform spaces has been created to 
accommodate a suitable generalization.) | 


Real numbers. In the notes at the end of chapter 1 we referred to the method of 
cuts which may be used for constructing the system of real numbers from that of 
rational numbers. It leads to Dedekind’s fundamental theorem (p. 16) from 
which, in turn, follow results such as those on the existence of suprema and infima 
of bounded sets of real numbers, the convergence of bounded monotonic sequences 
(C1, theorems 1.8 and 2.6), and the completeness of R* (theorem 3.42). 
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We now sketch a construction of the real numbers which is based on the notion 
of a Cauchy sequence. For a detailed exposition see H. A. Thurston, The Number 
System. 

Let Q be the set of rational numbers. Elementary analysis within Q employs 
the customary definitions with rational e’s. Thus the sequence (x,) of rational 
numbers is said to converge to the rational number x, and we write x, > x, if, 
given any rational ¢ > 0, there is an integer m, such that 


|x, —x| < € whenever 1 > 1M. 


Again, a Cauchy sequence in Q is a rational sequence (x,,) such that, given any 
rational e > O, there is an ny, such that 


|Xm—Xn| <€ whenever m,n > No. 


We define the equivalence of two Cauchy sequences (x,), (x,,) in Q as at the 
beginning of the proof of theorem 3.5: (x,) ~ (x)) if x,-x,>0 as n>. 
The equivalence classes generated in this way are then called real numbers. We 
denote real numbers by starred letters. If x is a rational number, the real number 
x* which contains the sequence (x, x,...) is called the corresponding real rational 
number. Note, however, that in general the use of the symbol x* does not carry 
with it the implication that x* is necessarily a real rational number. 

The next step is to define the algebraic operations with real numbers. If x*, y* 
are two real numbers and (x,), (x/,.) € x*, (vn), 0%) € y*, then (Xn + yn) ~ (4,47) 
and (xnyn) ~ (x, y,). We can therefore define x*+y* and x*y* to be the real 
numbers containing the Cauchy sequences (x,+ y,) and (x,y,) respectively. Also 
—x* is, unambiguously, defined as the real number containing (—¥x,). The 
definition of x*-!, when x* + 0*, is not quite so simple to justify. It needs the 
following lemma. 


(1) Tf (Xn) € x* + 0*, then there is a rational number k > 0 such that |x,| > k 
for all sufficiently large n. 

If (1) were false, there would exist a subsequence (x,,) tending to 0 and this, 
in turn, would imply that the Cauchy sequence (x,,) tends.to 0. Thus (1) holds. It 
follows that, if (x,), (x) @x* + O* and x, + 0, x, + 0 for all n, then (x7) 
and (x’~1) are equivalent Cauchy sequences. We can therefore define x*~* to be 
the real number containing (x; 1). It is now easy to prove that the real numbers 
form a field whose zero element is 0* and whose unit element is 1*. (See axioms 
A1-A6 on p. 15.) 

A slightly stronger form of (1) is that, if (x,) € x* + O*, then either (i) there is 
a rational k > O such that x, > & for all sufficiently large n, or (ii) there is a 
rational k > 0 such that x, < —k for all sufficiently large n. We then say 


x* > D*.-ore x* <0" 


according as (i) or (ii) holds (for the same statement holds for all Cauchy 
sequences in x*). Clearly x* > O* if and only if —x* < 0*. We define 


x* => p* ong ex) 


to mean that x*—y* > 0*. With this definition of > it may be shown that the 
field of real numbers is, in fact, ordered. (See axioms O1—O3 on p. 16.) 
Finally we define the modulus of a real number by 


I a ee 
|x*| = 


xtc te 2 


f 
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It is a simple matter to verify that the bijection between the rational numbers 
and the corresponding real rational numbers preserves algebraic relations and 
order, i.e. if x, y are rational numbers, then 


Q). (tyy* = x¥+y*, (xy)* = x*y*, (—x)* = —x*%) (&)* = x* 
and 

(3) x* = -y* according a5 x= y. 

Moreover, by (2) and (3), ; 

(4) jx[* = [x*| 

and so also 

(5) |x*| = |y*| according as |x| = ly]. 


At this stage we could develop the elementary analysis of real numbers on 
the basis of the familiar definitions of convergent sequences and Cauchy sequences 
(which now use real e*’s). Taking this development for granted we next show 
that the set of real rational numbers is dense in the set of real numbers, i.e. that, 
given any real number x*, there is a sequence (x,,*) of real rational numbers such 
that x,,* — x*. This is a corollary of the following assertion. 


(6) If x*, y* are real numbers and x* < y*, there is a real rational number u* 
such that x* <u* <.y*. 

To prove this let (x,) € x*, (y,) € y*. There is a rational number k > 0 such 
that y,—x, > k for n > v,, say. Also |x,,—x,| < +k for m,n > Ve, say. Let 
v = max ();, ¥2) and put u = x,+4k. Since 


Xnttk <u < y,z—} 


for n = v, the real rational number u* (which contains (u, u,...)) is such that 
ea Ay, 
The last theorem has three further important consequences. 


(7) If X, Xn (n = 1, 2, ...) are rational and x*, x,* (n = 1,2, ...) are the corres- 
ponding real rational numbers, then x,* > x* if and only if x, > x. 

Suppose that Xn + x. Given any real number e* > 0*, by (6) there is a real 
rational number 7* such that 0* < 7* < e*, By (3), 7 = 0 and so |x,—x| <7 
for n > v, say. This implies, by (5), that |x,*—x*| < 4 * < e* for n > v. Thus 
x,* > x*. The opposite implication follows immediately from (5). 


(8) A real rational sequence (x,*) is a Cauchy sequence if and only if the corres- 
ponding rational sequence (x) is a Cauchy sequence. 
The proof is similar to that of (7). 


(9) If x* is any real eee if (Xn) € x* and x,* is the real rational number corres- 
ponding to x,, then x,* > x*, 


Take any real number e* > 0*, By (6), there are real rational numbers 

n*, k* > O such that 0* < 7*+k* < e*. Since (x,) is a Cauchy Sequpnce, there 
ae v such that ~H—K < X_,—X_ < n+k 
when m,n 2 v. Let m > v. Then since (7, 7, ...)€9* and (Xn—X1, Xm—Xo5 -»-) 
E Xm" — x, the definition of inequality among real numbers implies that 

ee ee eee ee 

Hence x," > x*. 

We are now able to prove the fundamental theorem of the present theory. 
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(10) Every Cauchy sequence of real numbers converges. 

We adapt part (v) of the proof of theorem 3.5. Let (x*,,) be any Cauchy sequence 
of real numbers. By (6), for each v there is a real rati6nal number y,,*, corres- 
ponding to a rational number y,, such that 


xt Iat] < nt 
Then — [Yin® —Yn*| << |Ym* —X* | + |X*m— X* nl + [x*n—In*| 
< [x*,—x*,| + m* +n. 


Since (x*,,) is a Cauchy sequence, so therefore is (y,,*). By (8), (y,) is a Cauchy 
sequence of rational numbers which determines a real number y*. By (9), 
yn* > y*. Since 


\x*,—y*| < |x*n—Yn*| + |yn*—y*| < n*1+ |y,*—y*|, 


it follows that x*, > y*. 

We have shown that, with the usual metric, the set of real numbers is a com- 
pletion of the set of real rational numbers. Some of the arguments had counter- 
parts in the proof of theorem 3.5. There could be no question of applying the 
theorem, since real numbers are employed both in its enunciation and its proof. 

Rational numbers, having served their purpose, need no longer be used. We 
therefore abandon the * notation for real numbers. In particular, 0,+1, +2,... 
will now denote the real numbers which were formerly written 0*, +1*, +2%,... 

One more theorem is needed to set analysis on its customary course. 


(11) If the non-empty set E of real numbers is bounded above, then it has a least 
upper bound (supremum). 

Let U be the set of upper bounds of E. Take x, € U’, x, € U, so that x,\.< Xp. 
Either 2(%1 + X2) EU’ or 4(x,+x,) € U. In the former case put x, = 4(x,+ 2), 
X4 = x2; in the latter put x, = 1, %, = 4(*,+ x,). Continue this process. When 
X1, ...» Xen have been chosen, then 


Xonzy = 4(Xan-1 t+ Xen)y Xenee = Xan OF Xenyy = Xan-1, Xonva = FXan-1+ Xen) 
according as 4(Xs,_1+ Xen) belongs to U’ or to U. We have 

Xi S X3 S Xx Sw S Xe S XS Ko, 
with Xon_1 € U’, Xe, € U for n = 1,2, .... Also clearly 

|Xm—Xnl < (%q—x)/2” if m,n > 2v+1 


and so (x,,) is a Cauchy sequence. By (10), (x,) converges to a limit x. 
If ye E, y < Xe, for all m. Hence y < x for every ye E, i.e. xe U. But, if 
z < x, there is an z such that 
Z <-Xoni1 SX 


and so zé U’. Hence x is the least member of U, i.e. x is the supremum of E. 

Clearly Dedekind’s theorem is a consequence of (11). 

From this point the development of analysis follows the same lines whether 
cuts or Cauchy sequences are used to define real numbers. The two methods 
yield logically distinct objects (a cut of the rational numbers is not the same as 
an equivalence class of Cauchy sequences), but the effect is the same, since 
precisely. the same theorems hold for the two systems. 


4 
LIMITS IN THE SPACES R' AND Z 


4.1. The symbols O, 0, ~ 


In this chapter we shall develop the theory of sequences and series 
of real and complex numbers, building upon foundations such as are 
laid in Cl. We start by defining three symbols which contribute to 
brevity of statement. 

Suppose that (x,,) and (v,,) are sequences of real numbers. 

(1) When v,, > 0, we relate the magnitude of x, to that of v, by 
two definitions. 

(i) If there is a constant K such that 


ix) <= Ko, 


for all n, we write x, = O(v,). 
(11) If, asn > oo, 


a 
Vy 
we write X, = O(U,). 
(2) If, asn>o, 
Xn 
afte 
Vn 


we Writ€ Xn ~ Une 


Illustrations 


Xn, = O(1) means that x, is bounded. 
Xn, = o(1) means that x, >O0asn—>o. 
If x, = 5n?—7n+9, then 


y= 0). x ~50.. x, = 0%. 


Observe too that x, = O(n?) and that O does not rule out o or ~. 


The notation lends itself to an extended usage. We can write, for 
guess O(v,) + O(w,) = O(v,+W,). | (4.11) 


This means that, if a given sequence (x,,) satisfies x, = O(v,,) and a 
_ given sequence (y,,) satisfies y, = O(w,,), then x,+), = O(Un+ Wp). 
Note that, if v, = w,, the relation (4.11) becomes 


O(v,) + OW,) = Ol,). 
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We are also able to express concisely a number of other steps 
which could occur in the manipulation of sequences, such as 


O(n) + Op) = OWn)s 
O(n) OW) = OC Wn)s 


O(v,) O(Wn) = O(VnWn)s 
Y O(1) = O(n), 
Uy ~ Wy > U,+O0(W,) ~ Wp: 


To demonstrate how these symbols retain the gist of an argument and slough 
off what does not matter, we investigate the convergence of Xu,, where ~~ 


un, = 


and & is a constant. 

We have u, = O(1/n*-*), so, if k > 3, 4u, converges. 

An O relation, expressing as it does only an upper bound, can never establish 
divergence. However here we have 


and so Xu, diverges if k < 3. 


We have defined O, 0, ~ for functions of the integral variable 
nas n-> oo. The definitions are adaptable to functions of the con- 
tinuous variable x as x tends either to co, — o or to a finite limit. 


Illustrations 


As x > aw, x" = ofe*), sin x = O1). 
As x > 04, x7 = of2), o' "= of), sim & = Ole) or sin x ~ x. 
As x>0, sin (1/x) = O(1), 1—cos x ~ 4x’. 


Exercises 4(a) 


1. Find whether the relations x, = O(v,), Xn = O(Un), Xn ~ Un imply the corres- 
ponding relations between 


n n v1) n 
fa) J x and. 3 Ok es 
r=1 oat po | f=] 


2. Prove the following results. 
(i) If x, = OC /n*), where k > I, then 


S x, = O0/n-D 


r=n 


a | 
and S hp = ALOR, 


where A is a constant. 
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Gi) If x, = O(1/n), then 


x, = O(log n). 


iM: 


T= 


O(n*), where k > —1, then 


(iii) If x, 
n 
= Xp = O(n**}), 
r=1 


(Use the inequality 1/r* < | (1/x*)dx (k > 0).) 
a4 


r 


3. Prove that, if u, ~ v, and v, = O(1), then u,—v, = o(1). Show also that the 
hypothesis v, = O(1) cannot be omitted. 


4. Show that, if x, = O(1) and 2x, < X,_1+%n4, for all n, then (x,) converges. 


5. Establish the following relations. 
(i) As x0, log (1+x) = x+O(x’). 
(ii) As x > 0, (1+x)*% = 14+ax+O(x*). 
(iii) As x > 0, sin x = x—3x'+ O(|x|5). 


6. Prove that, for r > 4, 


1 (i 4r 
= < —~dx < —— 
PS Jena X 4r?--] 


and deduce that, as r >, the value of the integral is 1/r+O(1/r*). Hence 
show that on 


1 1 1 
—- = log 2———-+ O|-—}. 
pee? Be An+2" ( ) 
7. Prove that, as x > > pee s 
; = eta E nei xetn 2x 


(Use the fact that, if the function f on [1, 0) is positive and decreasing, then 
N N 

Y f(y)- i f(x) dx tends to a limit / such that 0 < / < f(1) (Cl, 141).) 
n=1 1 


4.2. Upper and lower limits 


So far in this book (and in C1) the symbol oo has only occurred in 
the designation of an interval (such as O < x < o) oras part of the 
phrase ‘x, — 0’. It is now convenient to extend the usage. For 
instance we shall use ‘lim x, = 00’ to mean the same thing as 
‘X, > 00°; or ‘sup x, = oo’ to mean that (x,,) is not bounded above. 
Other phrases involving oo occur in the two definitions of this 
section. However, as heretofore, we shall not attempt to perform - 
any algebraic operations with oo and — oo. 
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The sequence 1 
X, = sin gn + 
does not converge, but 


Xan > — 1, Xa, —> 0, Xar4+1 > I 


as r — oo. Hence 1 and —1 are the greatest and least numbers which 
are limits of subsequences of (x,). These will be called the upper and 
lower limits of (x,) (see theorem 4.21, corollary 2). For the formal 
definitions we prefer, however, a different approach. _ 

Given any sequence (x,,) of real numbers, consider successively the 


sequences 
< X15 Nes X35 eoeg Xn Xn+1s eerg 


(4.21) 


e. @- C3 6c" 6 -3 S88. OU 8. 8 Oe 


If (x,) is bounded above, these sequences have suprema 
Ba, Ks, oss Bee nes 


For all n, kK, > &K, x, sce 14, Xe Gas ss > 1X eee 
(in fact K, = max (x, K,4+,)). Hence K,, tends either to a finite limit 
or to —oo. Moreover K,, > — oo if and only if x, ~ —oo. For if 
Xn > — 00, then, given any number K, however large, there is an ny 
such that x, < —K forn > np. Hence, forn > m, K, = sup x, <—K 


r2>n 

and so K, > — oo. On the other hand, if K, ~ — oo, then x, > —©, 
sce 2 eA. 

If (x,) is not bounded above, none of the sequences (4.21) is 
bounded above, i.e. sup x, = © for all n. 

T2N 

Definition. Let (x,) be a sequence of real numbers. 

If (X,) is bounded above, 

A = lim (sup x,) 


NO T2>N 
is called the upper limit (or limit superior) of (x,,) and we write 


lim x, = A or limsupx, = A. 


Nn—->0oO no 
If (x,) is not bounded above, we write 


lim x, = 0 or lmsupx, = o. 
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If (x,) is bounded below and k,, k2,... are the infima of the 
sequences (4.21), then k, < ky, for all m and k, tends to a finite 
limit unless x, > 00. We are therefore led to the following definition. 


Definition. If (x, is bounded below, 
A = lim (inf x,) 


nO 72>N 
is called the lower limit (or limit inferior) of (x,,.) and we write 


limx, =A or liminfx, = A. 


no i a 


If (X,) is not bounded below, we write 


limx, = —o or liminfx, = —0©o. 
n—>oo n> 
We have lim inf x, < lim sup x,, (4.22) 
n—>o no 


since inf x, < sup x,; and 


T>Nn r>n 


lim inf (—~x,,) = —lim sup x,, (4.23) 


since inf (—x,) = —sup x,. Note that the upper and lower limits 


r>n r>n 
in (4.22) and (4.23) need not be finite. 
A convenient characterization of finite upper and lower limits 
is given in the next theorem. 


Theorem 4.21. 
(i) The number A is the upper limit of the sequence (x,) if and only 
if, given e > 0, 
(a) x, < A+e for all sufficiently large n; and 
(b) xX, > A—e for infinitely many n. 
(ii) The number A is the lower limit of the sequence (x,) if and only 
if, givene > QO, 
(c) x, > A—e for all sufficiently large n; and 
(d) x, < A+e for infinitely many n. 


Proof. 
(i) Necessity. Suppose that A = lim sup x, = lim K,, where 
K, = sup x,. Then, given e > 0, there is NV such that 


T]>N 
Ky < Ate 


and so Xx, < Ate for n2N. 
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This is (a). To prove (5) we need only show that, given any integer p, 
there is an” > p such that x, > A-—e. 
Since (K,,) decreases, K,, > A for all m. So, by the definition of 


K,,41, there is an nm > p+1 such that 


Dp 
hg eg Pe, 
This gives (db). 
Sufficiency. Suppose that (a) and (d) hold. By (a), K, < A-+e for 
n > N;by (0), K, > A—eforalln. Thus K,, > A,i.e.A = lim sup x,. 
(ii) This follows from (i) and (4.23). | 


Corollary 1. The sequence (x,,) tends to las n > 0 if and only if 


lim inf x, = lim -sup x, =: 
Proof. 
(i) / finite. 
If x, > /, then, given e > 0, there is N such that 


l-e< x, < l+e (4.24) 


forn > N. Hence / = lim sup x,, and / = lim inf x,. 

If lim sup x, = lim inf x, = /, then (a) and (c) show that (4.24) 
holds for all sufficiently large n. Therefore x, > /asn—- 0. 

(ii) / infinite. 

Xn, > oo if and only if liminf x, = o. Also liminfx, = 
implies lim sup x, = ©. 

x, > — 0 ifand only iflim sup x, = —o;andlim sup x, = —2 
implies lim inf x, = —o. | 


Corollary 2. A = lim sup x,, when finite, is the largest number which 
is the limit of a subsequence of (x,); A = lim inf x,, when finite, is the 
smallest such number. 


Proof. The conditions (a) and (5) show that, for every e > 0, 
there are infinitely many m such that A—e < x, < A+e. From this 
follows that there is a subsequence (x,,,) with limit A. Also (x,) 
cannot have a subsequence converging to a number A’ > A, for if 
e < A’—A, there is only a finite number of ” for which x, > A’—e. 

The statement about lower limits is proved similarly or by use of 
(4.23). | 


The analogue of corollary 2 for infinite upper limits is that 
lim sup x, = —0o if and only if x, ~ — 0; and lim sup x, = oo if 
and only if there is a subsequence of (x,,) which diverges to 00. The 
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first statement has already been proved; for the proof of the second 
see exercise 4(b), 2. There is a similar characterization of infinite 
lower limits. 


Illustrations 
i) X, = (—1)"; lim inf x, = —1, limsup x, = 1. 
(ii) x, = 2n/(n+1); lim inf x, = lim sup x, = 2. 
(iii) x, = n{1+(—1)"]; lim inf x, = 0, lim sup x, = ©. 
(iv) x, = 1—n; lim inf x, = lim sup x, = —©O. 


A proof of the completeness of R! was given in theorem 3.42, but 
a more natural proof uses upper and lower limits. 
Let (x,,) be a Cauchy sequence. Given e > 0, there is an N such 


ae \Xm—-Xn| <¢€ for m,n 2 N, 


Thus Xy-€ <X,<Xyte for n=>N 


andso xy—é < ky < limk, < lim K, < Ky < xyt+e. 


no no 
Hence lim inf x,, and lim sup x,, are finite and 
0 < lim sup x,,—lim inf x, < 2e. 
Since ¢ is arbitrary, lim sup x,, = lim inf x, and, by theorem 4.21 


corollary 1, (x,) converges. | 


We now turn to functions of a continuous variable for which we 
define upper and lower limits at oo (or — 00) and at a point. 


First suppose that fis defined on an interval (c, 00). If fis bounded 
above in some interval (X, 0), we define 
lim sup f(x) = lim (sup /(‘)); 
Zo azo t>2x 
if f is unbounded above in every interval (X, ©) we write 


lim sup f(x) = ©. 


The definition of lim inf f(x) is similar. 


TF 


Next let f be defined on (a—h, a) U (a, a+h), say. If f is bolinded 
above in some set (a—7, a) U(a, a+%), we define 


lim sup f(x) = im ( sup # (x)); 


—-0+ 0<|z-al< 


if f is unbounded above in every set (a—7, a) U(a, a+), we write 


lim sup f(x) = 
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The analogues of theorem 4.21 and corollary 1 are left to the 
reader. 


Illustrations 
(i) limsup sin x = 1, liminfsinx = —1. 
xL—>0 xa2—->oo 
(ii) lim sup sin (1/x) = 1, lim inf sin (1/x) = —1. 
xz—>0 xr>0 


(iii) lim sup eV” = ©, lim inf e?* = 0. 
xz—>0 


xz—0 


Exercises 4(5) 


1. Prove that x, = O(v,) if and only if lim inf (x,/v,) and lim sup (x,/v,) are 
finite. 


2. Prove that lim sup x, = 00 if and only if a subsequence of (x,,) diverges to 00; 
and that lim inf x, = —0o if and only if a subsequence of (x,) diverges to — 0. 


3. Let (x,), (v,) be sequences of real numbers such that x, < y, for all x. Prove 
that 


(i) lim inf x, < liminf y,, (ii) lim sup x, < lim sup yy. 
4. Show that, if (x,), (y,,) are bounded sequences, then 
lim inf x,+lim inf y, < lim inf (x,+y,) < lim inf x,+lim sup y, 
< lim sup (x,+y,) < lim sup x,+lim sup y,. 
Construct sequences (x,), (¥,) for which strict inequalities occur at every step. 


5. The sequences (x,), (vn) are non-negative and bounded. Establish a chain of 
inequalities as in 4 with products in place of sums and give an example in which 
strict inequalities occur at every step. 


6. (i) Let y, > y > 0. Prove that 
(a) if lim inf x, is finite, then 


lim inf (x,yn) = y lim inf x,; 


(b) if lim inf x, = co (—©o), then lim inf (x,y,) = © (—%); 
(c) if lim sup x, is finite, then 


lim sup (X,¥n) = y lim sup x3 


(d) if lim sup x, = © (—oo), then lim sup (%,y,) = 0 (—0). 
(ii) Show that (a), (6), (c), (d) need not hold if y, + 0. 
(ili) Let y, >y < 0. Prove that, if (x,) is bounded, then 


lim inf (x,y,) = y lim sup x,, lim sup (x,y,) = y lim inf x,. 


7. The function f: R! > R' is continuous and increasing. Prove that, if (x,) is 
any bounded sequence of real numbers, then 


f(im inf x,) = lim inf f(x,), /(im sup x,) = lim sup f(x,). 


4.2] | UPPER AND LOWER LIMITS 83 
8. Let Sn =< uy+ 5 a i — v,+ eee Pn, 


where (u,) is an arbitrary sequence of real numbers and (v,) is a sequence of 
positive terms such that ¢, > oo. Prove that 


—— oS s u 
lim inf — < lim inf rd < lim sup < lim sup —. 
v 


9. Let (x,) be a sequence of real numbers. Show that 


ened ee rate ee G : oP a oP 2 : 
(i) lim inf x, < lim inf Saat < lim sup ae < lim sup x,; 
and, if x, > 0 for all z, 

(ii) lim inf x, < lim inf (x, ... x,)"" < lim sup (%4...X,)"" < lim sup xp; 


Xn+1 Xn41 


< lim inf x1” < lim sup x1” < lim sup — 


nT nr 


(iii) lim inf —— 


10. Prove that, asn > ©, 
G@) ifk> —-l, 
1*+2%+ +n ~ nk +1) 
(cf. exercise 4(a), 3); 
Gi) = 7 4 
Gili) (n!)¥" ~ n/e. 


11. One-sided upper and lower limits. Suppose that the real function fis defined 
in the interval (a, a+h). If fis unbounded above in every interval (a, a+7), we 
write lim sup f(x) = ©; otherwise we define 


ee lim sup f(x) = lim ( sup J (x)). 


Lat 60+ a<x<at+é 


lim sup f(x) and the two one-sided lower limits are defined similarly. 
wa — 


Prove that, if fis defined on (a—h, a) U (a, a+h), 
lim sup f(x) = max {lim sup f(x), lim sup f(x)}, 
z2—a~> L—a+ 


wa 


lim inf f(x) = min {lim inf f(x), oe srs f(x}. 


ta r—-a— 


12. Semi-continuity. The function f: [a,b] > R! is said to be upper semi-con- 
tinuous at the point c € (a, b) if 


f(c) = lim sup f(x); 


xw—>C 


and fis said to be lower semi-continuous at c if 


f(c) < lim inf f(). 


Semi-continuity at a, b is defined by use of the appropriate one-sided upper and 
lower limits. 

Show that, if fis upper (lower) semi-continuous in [a, b], then fis bounded 
above (below) and attains its supremum (infimum). 


84 LIMITS IN THE SPACES R! AND Z [4.2 


13. For the function /: [a, b] > R' define 


Df(c) = lim inf KO HOI) , .Dflc) = lim sup i o — f(x) 


xL—>c —X 


when c € (a, b), and define Df(a), Df(a), Df(b), Df(b) as the corresponding one- 
sided upper and lower limits. 

Prove that, if Df(x) = 0 for a < x < b, then f increases in [a, bd]. (First take 
Df(x) > 0 in [a, b] and use exercise 3(g), 4.) Show also that the condition 
Df (x) > 0 in [a, 6] does not ensure that f increases in [a, 5]. 


4.3. Series of complex terms 


The familiar tests of Cauchy and d’Alembert for series of positive 
terms (C1, 88) may be strengthened by the use of upper and lower 
limits. It is also convenient to have these tests in a form in which 
they apply to series of complex terms, and ensure either absolute 
convergence (and so convergence) or divergence. 


Theorem 4.31. (The root test-Cauchy.) Let xu, be a series of 
complex terms. 

(i) Jf lim sup |u,|/" < 1, then Xu, converges absolutely. 

(ii) Zf lim sup |u,|"" > 1, then Xu, diverges. 


Proof. 
(i) Let lim sup |w,|/" = A and take c so that (0 <)A <c <1. 
By theorem 4.21, there exists N such that 


la, <0 eee | =o" 


foralln > N.SinceO < c < 1, Xc” converges and so, by comparison, 
x|u,,| converges, i.e. Lu, converges absolutely. 

(ii) Suppose that lim sup |u,,|/" > 1. If the upper limit is infinite, 
|u,,|"" is unbounded. Otherwise we again appeal to theorem 4.21. 
In either case we see that there are infinitely many n for which 
|u,|V/" > 1, or |u,| > 1. Therefore u, +> 0 and so Zu, diverges. | 


Theorem 4.32. (The ratio test-d’ Alembert.) 
(i) If lim sup |u,+1/u,| < 1, then Xu, converges absolutely. 
(ii) Zf lim inf |u,4:/u,| > 1, then Xu, diverges. 


Proof. 
(i) If lim sup lun salMn < c < 1, there is an N such that 


lt .a( tal Roe fer ae a WN, 
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anen, for 2 > N, 


| 


Fe eee eee < |uylov-N = Ac” 
un 


Un—-1 


and therefore &|u,,| converges. 
(ii) There is an N such that |u,,,/u,| > 1 for n 2 N. Hence 
lay| <-|anaal) = 2, and so y, ->-9. | 


Exercise. Give an example of a convergent series =u, with 
lim sup |un41/Un| > 1. 
By exercise 4(5), 9 (iii), 
lim inf |2,,.,/u,| < lim inf |u,|” 
< lim sup |u,|!” < lim sup |u,43/u,|. 


Therefore theorem 4.32 follows from and is weaker than theorem 
4.31. (See exercise 4(c), 2.) 


Exercises 4(c) 
1. Give examples to show that, when lim sup |w,|!" = 1, the series Xu, may 
converge or diverge. Show also that both convergent and divergent series may be 
such that lim sup |u@ri1/ué,| = 1 or lim inf |un41/u,| =1. 


2. Give an example of a series which theorem 4.31 shows to converge and to 
which theorem 4.32 is not applicable. Give a corresponding exar le ofa 
divergent series. 


3. The sequence (u,) of real numbers is such that lim sup u, < 0. Show that, if 


a > 1, then the series Tguetetin 


converges. 


4. Investigate the convergence of the series whose nth terms are 


(2n'F a. (a!) Bm)! 
(i) (4n)! 30"... (i) (4n)! 10”. 


5. Show that the complex series 
n! 


py (1 —in)” 


converges absolutely. 


6. Find all the real values of x for which the series &x”/n* converges. 


7, Find all the pairs of complex number a, b for which the series Xa”’b" converges. 
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4.4. Series of positive terms 

In this section all series have positive terms. Although theorems 
4.31 and 4.32 are useful for such series, more delicate tests are 
available. 

The proofs of the last two theorems use a comparison principle, 
introduced in C1, in which the terms of a given series are compared 
with those of a series whose convergence or divergence is known. 
We may also compare ratios of terms. 


Theorem 4.41. If, for alln > N, 


Unii — Unit 
a eee 


Un Un 


and Xv, converges, so does Xu,. If the opposite inequality holds and 
xv, diverges, then Xu,, also diverges. 


Proof. The statement about convergence follows from the relation, 
forn > N, 
u u v v u 
a ee ee at ee 
Uy = Un-1 Uy (Un-1 UN 


To apply comparison principles we need a stock of standard series 
which are known either to converge or to diverge. Such a stock can 
be obtained from the theorem (C1, 141) that if, on [a, 0), fis a 
positive decreasing function, then the integral and the series 


| “fodds, Eft) 


both converge or both diverge. 
For the next theorem only, write log, x = log (log,_,x) (r 2 2), 
where log,x = log x. 


Theorem 4.42. (The logarithmic scale.) Each of the series 


] ] ] 
De n(log n)*’ 9 2 nlogn log, n...log,_, n (log, n)*? “"" 


converges if k > 1 and diverges if k < 1. 


Proof. It must be supposed that the summation starts with a value 
of n (say a) large enough for the repeated logarithms to be defined. 
The value of the associated integral 


| Be 
a x log x log, x...log,_, x (log, x)* 
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(log, |. 

i [oe —| fk +1 
x 

and is loge. x| ith = ‘f, 


As X > o, the integral tends to 0 if k < 1 and to a finite limit if 
ke 1] 


The next theorem arises from the application of theorem 4.41 to 
the first two series of the logarithmic scale. 


Theorem 4.43. (Ratio test of Gauss.) Jf it is possible to write 


where yt is a constant, then Xu, converges if 4 > 1 and diverges if 
"es © 


Proof. Suppose # > 1. Let 1 < q < yw. Let v, = n-4. Then, by 
exercise 4(a), 5(1i), 


Ye = (MH 14 24.0(1), 
nN n 


Un41 


The inequality, for all sufficiently large n, 
Un Pn 
Unit = Unt 
gives the convergence of 2w,,. 
Similarly, if ~ < 1, Xw,, diverges. 
How =f, cefiine 


l 
v, = ——.. 
nlogn 
anf 
ee ( +) [1428 (l+n 4) 
ee n log n 
eh nee eer ye) 
n nlogn n 
For all sufficiently large n, 
Un, e., 


Therefore Xu, diverges. | 
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The convergence tests of this section and the last are all based on 
comparison with standard series. No such test can be universal; for, 
given any convergent (divergent) series, there always exists a series 
that converges (diverges) more slowly. This fact is illustrated by 
exercises 7-10 below. 


Exercises 4(d) 


1. Another comparison principle. Show that, if Xu,, Xv, are series of positive 
terms and = a 
0< lim inf < lim sup ~ <0; 


then =u, and =v, either both converge or both diverge. 


2. Investigate the convergence or divergence of the series whose nth terms are 


(i) n-"(log n)~, Gi) Geen) Ve”, 
(iii) (log log n)~!8", (iv) (log n)—!08 ls”, 
| 2n+1 
—1—(1/n) ; se Ree ee 
(v) an ; (vi) 1—nlog aE 
.. (N+a)" ay (S-) 
(vii) aes (a real), (viii) ae Ss : 


(ix) ja’*-1|* @>0), Gy G*-—T)*. 
3. Show that the series 


z (1- 


converge fork > landdivergefork < 1.(Compare with &n~* and =n-'(logn)-*.) 


a ") a. (1-8 = ee") 


4. Find all the positive values of a for which the series 


> (an)" 
n! 
converges. 


5. The condensation test. The sequence (u,) is positive and decreasing and p is an 
integer greater than 1. Prove that the series 
oe) co 
Su, and >, psa 
1 1 


both converge or both diverge. (Group the terms in <u, appropriately; it is 
advisable first to consider the case p = 2.) 
Deduce the logarithmic scale. 


6. Let =u, be a divergent series of positive terms and let s, = u,+...+Un. Show 
that the series P 


converges if k > 1 and diverges if kK < 1. [For convergence prove that, when 


kK <1, a oe ( po 
— << -—— eis pete ecemns - 
oe bi es re) 
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7. Given a convergent series Xu, of positive terms, construct a convergent series 
Xv, such that v,/u, -> ©. 


8. Given a divergent series Xu, of positive terms, construct a divergent series 
Lv, of positive terms such that v,/u, > 0. 


9. Given an increasing sequence (g,) with limg, = 0, construct a convergent 
series Xu,, such that &uw,,q, diverges. | 
10. Let Uy tUygt gt .-.., 

Ug, + Ugg + Uvg 5 eoey 


Us; + Ugg + Ugg + esey 


be an infinite scale of divergent (convergent) series of positive terms each 
diverging (converging) more slowly than the one before in the sense that, for 
EVEry FL, Urs3,n/Ur.n > 0 (00) as n> co. Construct a series which diverges (con- 
verges) more slowly than every series of the scale. 


4.5. Conditionally convergent real series 


A conditionally convergent series is one that converges, but not 
absolutely. It was shown in C1 (93) that rearrangement of terms does 
not affect the sum of an absolutely convergent series, but may change 
the sum of a conditionally convergent one. A more precise statement 
is possible. 


Theorem 4.51. (Riemann.) Let u,, be real and Xu, conditionally con- 
vergent. Then the terms of the series can be rearranged so that the sum 
of the first n terms tends to any given limit or has assigned upper and 
lower limits (finite or infinite). 


Proof. Denote the positive terms in Xw,, in the order in which they 
appear by vj, v,,... and the negative terms by w,, w,,.... Then the 
series Lv,, Lw, both diverge; for if both converged, =|u,| would 
converge and, if one converged and the other diverged, Xu, would 
diverge. 

We shall rearrange the series to converge to a given sum f¢. Take 
the smallest number of v’s for which v,+v,+...+v, 2 t. Then take 
for the next terms of the rearranged series just enough w’s, starting 
with w,, to make the total sum < ¢. Then take more v’s in order 
from v,., to make the sum 2 f. 

Since Xv, and Xw,, diverge, this process can be continued indefin- 
itely. Let t,, be the sum of the first n terms of the series so constructed. 
To prove that ¢, >t we remark that |t—t,| < |u,,|, where u,, is 
the final term of the last complete block of positive or negative terms 
included in ¢,,; and u,, >0asm— oo. 
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In view of theorem 4.21 corollary 2 it is easy to modify the process 
so that (¢,,) has any assigned upper and lower limits. | 


The tests of the last two sections are used for proving absolute 
convergence of series whose terms are not all positive. Theorems 4.53 
and 4.54 below are often useful for establishing conditional con- 
vergence. These theorems depend on the following simple identity. 


Theorem 4.52. (Partial summation.) Jf ay, ..., Gp, Vy, ..., Vp are any 


ke 
numbers and s, = > a; (k = 1,..., p), then 
i=1 


AyVy + 0. FA gVy = 54(01— V9) +... +Sp—1(Up—-a = Up) ESD pe 


Proof. Each side is 5,0, + (S2—51) 02+... +(Sp—Sp-1) Vp. | 


Corollary. If v, > ... > Vp > 0 and ay, ..., ay are any real numbers 


such that i kee eS A | el, 


then [Vy < Ay +... +A, V, < Mr. 

Proof. Since v1 — V9, ...5 Up-1 — Ups Vp are Non-negative, 

SO, Dg +. FS 0p) Fo 

lies between the sums obtained by replacing each s, by w and M 
respectively. | 
Theorem 4.53. (Dirichlet.) Let a,, v, (n = 1, 2, ...) be real. If 

(i) the sequence (s,,), where s,, = = a,, is bounded, 

(ii) the sequence (v,) is alist and tends to 0, 
then Xa,V,, converges. 


Proof. We may suppose that (v,,) decreases, so that v, > 0. By (i), 
there is an H such that |s,| < A for all. Then 


Therefore, by the last corollary, 
|Ant1Pntt +... Bn, pOiap| - 2H 741 


for all p > 1. Since v,, + 0, the sequence (t,,), where 
n 
,, = ya AV ps 
r=1 


is a Cauchy sequence. As R! is complete, lim £, exists, i.e. 2a,v, 
converges. | 
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9 alternating series theorem (C1,91) is the special case 
= (—1)” of theorem 4.53. 


Theorem 4.54. (Abel.) Let a,, v, (n = 1, 2, ...) be real. If 
(i) Xa, converges, 
(ii) the sequence (v,,) is monotonic and bounded, 

then Xa,,V,, converges. 


Proof. We can prove, as in theorem 4.53, that the partial sums of 
xa,v, form a Cauchy sequence. Alternatively we can deduce 4.54 
from 4.53 as follows. 

From (ii), v, tends to a limit, say v. Let v, = v+w,, so that the 
sequence (w,) is monotonic and tends to 0. By (i), 4a,v converges. 


n 

Also (s,,), where s, = > a,, is bounded and so, by theorem 4.53, 
r=1 

xa,W,, converges. | 


Example. If (v,) is monotonic and tends to 0, then Xv, sin nx converges for 
all x and Xv, cos mx converges except, possibly, for x = 2km7 (kK = 0, +1, 
a2. ..). 

When x + 2k7, 


> ae en cos 4x—cos (n+ 4) x ve 

pack 2 sin $x |sin 4x| 

ae sin (n+ 4) x—sin 4x 1 
and cos rx} = |——+——_ | < > 

2 2 sin 4x |sin 4.x| 


and the result follows from theorem 4.53. 

When x = 2kz, sin nx = 0 for all n and so Xv, sin nx converges; cos nx = 1 
for all n, so that Xv, cos nx is Xv,, which may or may not converge. 

This example is frequently used in discussing the convergence of a power 
series at points on its circle of convergence. 


Exercises 4(e) 
1. Examine for convergence the series whose nth terms are 


®@ CI @>0, @ ae. 
c zs (iv) — (n > 1), 
(v) (vi) (a¥"—1) cos nx, 
(vii) (—1)"n%erV®, a ae 
pe OE a sAr & 00s na (% > 0). 


log n 
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2. Find all the real values of x for which the series 


: 1 1 
(i) 2 lat Det @eDt — +o sin nx, 
_ 1 1 1 
A Pee ee ae 
(ii) is fen D?* (nt ae eax +5 sin* nx 
converge. 
3. Show that the series > ( | ++ oa +) — 


converges absolutely for x = k7(k = 0, +1, +2, ...) and conditionally for all 
other (real) values of x. 


4. The series Xa, converges conditionally. Show that 
(i) 2Xa,n* diverges if k > 1, and 
(ii) if =(a,,/n*) converges, so does L(a,,/n*) for B > a. 


Deduce that there is a real number a, > —1 such that X(a,/n%) converges for 
a > a&, and diverges for @ < Gp. 


5. The sequence (a,,) is monotonic and converges to the limit /. Prove that the 
sum s,, of the first 7 terms of the series 


a (—1)"a, 
n=1 
is such that lim sup s,—lim inf s, = |/]. 


6. Establish Dedekind’s extension of Abel’s theorem 4.54 that, if both the 
series La, and X|v,—vn,;| converge, then La,,v, converges. State and prove a 
corresponding extension of Dirichlet’s theorem 4.53. 

The real function f on [1, ©) is such that f’(x) > 0 for x > 1 and f(x) > 0 


as x > 00. Prove that f(x) > 0, f(x) < 0 for x = 1, and that >) f’(m) converges. 
1 


Use the mean value theorem to deduce the convergence of 
~ (—1)"f() sin {log f()}. 


7. The series Xu, converges and the series Xv, of positive terms diverges. Show 


that lim inf (u,/V,) < O < lim sup (uy/v,). 


Find pairs of series Xu,, Xv, such that 
(i) lim inf (u,/v,) = —, lim sup (4,/v,) = ©; 
(ii) u, > O, lim inf (u,/v,) = 0, lim sup (u,/v,) = ©. 


co 


foe) 
8. The series >) u,, converges absolutely and the series >) v, converges. Let 
0 0 


Wa = UgGUntUyUnrzt...-+lndo (2 = 0,1, 2, ...). 


nT n 
Prove that y, We = DG hs 
r=0 


r=0 
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oO io @) 


where ¢ = >) v,,7, = > v,, and deduce that 
0 n+1 


$2 v= (3 un) ( & on) 
n=U n=0 n=0 
(In theorem 5.7 of C1, Xu, and Xv, are both assumed to be absolutely con- 


vergent and 2w, is proved to be absolutely convergent also.) 
Construct conditionally convergent series Xu,, Xv, such that =w,, diverges. 


9. The terms of the series 1—4+4-44... 


are rearranged according to the following rules. 


(i) The order among the positive terms and the order among the negative 
terms remains unchanged. 


(ii) If, of the first n terms, p, are positive and q, are negative, p,,/g, > k (> 0) 
as n > ©, 


Prove that the sum of the series is 4 log 4k. 


10. The sequence (a,) is decreasing and a, ~ //n. Show that, if the terms of the 
series 


Got Gg — Gy t oc 
are rearranged as in 9, then the sum of the series is increased by 4/ log k. 
11. For what values of & is the series 
SS ee Se ee Ee 
See 


with a negative term following two positive ones, convergent? 


carr 


4.6. Power series 


In Cl, §5.5 and §5.6, it was shown that every power series Za,,z” 
has a radius of convergence R and that R = lim |a,/a,,,|, if this 
limit exists. By using upper limits we can give a simpler proof of the 
first result and we can sharpen the second. 


Theorem 4.61. 
(i) Zflim sup |a,|"”" = 0, then Xa,,z" converges absolutely for all z. 
(ii) Zflim sup |a,,|/" = 00, then Xa,,z” converges for z = 0 only. 
(iii) Jf0 < lim sup |a,|"" < oc and 
2 l 
~ jim sup |a,,|1”’ 
then Xa,,z" converges absolutely for |z| < Rand diverges for \z| > R. 
The radius of convergence of Xa,,z" is therefore R, where R is called 
00 in the case (1) and 0 in the case (1i). 
Proof. 
(i) For every z, lim sup |¢,2"|"" = |z| lim sup |ja,|"" = 0 and 
so, by theorem 4.31, Xa, z” converges absolutely for all z. 
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(ii) For z + 0, lim sup |a,z"|¥/" = |z| lim sup |a,|/" = 00 and so 
Xa,z” diverges for all z + 0. 

(iii) If |z| < R, then lim sup |a,z"|¥" = |z| lim sup |a,|1 < 1 
and so a,z” converges absolutely. 

If |z| > R, then lim sup |a,z"|/" > 1 and so Za,z” diverges. | 


Theorem 4.62. The power series Xa, z" and Xna, z"—1 have the same 
radius of convergence. 
Proof. By exercise 4(5), 6 
lim sup |na,,|"" = lim sup |a,|¥”, 
since lim nV” = 1, | 


Corollary. 

(i) All the series obtained by differentiating a power series term by 
term any numbers of times have the same radius of convergence. 

(ii) A similar result holds for integration. 


Exercises 4(f) 
1. Find the (complex) values of z for which the following power series converge. 
@ > Ac : *) : Gi) Enter 
GO Bae 6) eee. 
W) Lava w) 3 (SS). 
(vii) > 2 *) z” (areal), (viii) ) — a a 
(ix) SRI r( = is in)? (x) yee (a real). 


2. Show that the binomial series > Fl z” (a real) has radius of convergence 1 
n=0 
unless & is a non-negative integer (in which case the series terminates). Prove also 


that 
(i) when « > 0, > (") z" converges absolutely for |z| = 


(ii) when -1 <a < 0, (") z" converges conditionally for |z| =1,z +—1 
and diverges when z = - ® ) 
(iii) when e < —1,)>) (* ") z” diverges for |z| = 
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(To prove (ii) show that 


CO) ets) > lorie © 


if k is sufficiently small and deduce that (") >0Oasn—> oo.) 


3. Investigate the convergence of the hypergeometric series 


> a(at 1)..4e+A=1) 06+ 1),..(6+n—1) °° 
mat n! c(c+1)...(c+n—1) ; 


b 


where a, b, c are real numbers not equal to 0, —1, —2,.... 


co oO 


4. The power series }) a,z", >) b,z” have radii of convergence R, S respectively. 
0 0 


(i) Show that the radius of convergence T, of X= (a,+5,)z” is min (R, S) if 
R + S. What can be said about 7, when R = S? 

(ii) Show that the radius of convergence T, of 2 a,,b,,z" is at least RS (where 
Roo is interpreted as oo if R > 0). Construct an example in which 7, > RS. 

(iii) Show that the radius of convergence 73 of & (@yb, +, bn_1+...+@nb9)z” 
is at least min (R, S). Construct an example in which T; > min (R, S). 


5. The sequence (dp, @;, de, ...) of non-negative real numbers is decreasing and 
ay > 0. Show that the radius of convergence of 2 a,z" is at least 1. 
Prove that, if |z| < 1, then 


| (1 -(¥ a2") —Ap 


ioe] 
and deduce that }} a,z” + 0 for |z| < 1. 
0 


< Apo 


4.7. Double and repeated limits 


Let X be any set. A double sequence of elements of X is a function 
whose range is in X and whose domain is P?, the set of ordered pairs 
of positive integers. The members xX, (m = 1, 2,...3 mn = 1, 2,...) 
of a double sequence may be arranged in an array 


We shall only consider double sequences of real or complex numbers. 


Definition. The complex double sequence (Xm) is said to converge 
to the number x if, given e > 0, there is an Ng such that 


\Xmn—X| < € whenever m,n > nN; 
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we then write 
Xeon OX GS in, uO of Tih xX = =x. 
M,N 


Illustrations 


(i) oe >Oasm,n—>o., 


m+n 
= : = aaa. 
Gi) = lim does not exist; for ~ for all m and -——— = -. 
m,n—>o N+Nn mtm 2 m+2m 3 


It is natural to try to relate the double limit lim x,,, with the 


mM, NO 


two repeated limits lim (lim Xmn) and lim (lim Wieck 


TiS Nh nO mMm—- 


Theorem 4.71. If lim Xn, exists and, for each m, a: Xopin Oris, 
Mm, N—>O 
then lim (lim x,,,) exists and is equalto lim Xn. 


mo n>. M,n—-2 


Proof. Let Ame X= x and ia Xnin = Vo, (R= 1,2, carven 


Mm, N->O 


€ > O, there is an N such that 

\Xmmn—x| < € for m,n> N. (4.71) 
Also, given m, there is a P(m) such that 

\Xmn—Ym| < € for n> P(m). (4.72) 


Now take m > N. If n > max (N, P(m)), (4.71) and (4.72) both hold 
and give 
lY¥m—x| < 2e. 


Thus vy, > xX asm-— oo, | 


Note that the existence of lim x,,,, does not ensure the existence 


m, N—>O 


of lim x,,, for any m (cf. x,,, = (—1)”/m). 


no 


Theorem 4.71 has an immediate corollary. 
Theorem 4.72. Suppose that (i) for each m, Los Xmn exists and, for 


each n, iim Keng CXistS; Ve Xig, CXISES. T. tee lim (lim x,,,,) and 


Mm, n>2 mo nO 


lim ins 45 exist and are equal. 


nO ™M—>O 


Illustration (ii) shows that lim (lim x,,,,) and basi (lim Xn) May 


m—>o n> —>o m—->o 
well exist without being equal. The inversion Ps successive limiting 
operations is generally a delicate matter; there is one simple criterion, 
of monotonicity, which we now give. 
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Theorem 4.73. Suppose that each Xin is real and that, for each m, 
Xn increases (decreases) withn and, for each n, Xm, increases (decreases) 
with m. If one of the limits 


lit Rees tik GM Xx)s pn ee Miu) 


Mm, N—->O moO N->0 —>eo 


exists, then the other two also exist and all me limits are equal. 


Proof. We may assume that x,,, increases with m and n. 

(i) Suppose that Xx», > x as m,n — oo. Then Xm, < x for all m,n 
Hence, for each ™, (Xi; Xm. ---) 1S a bounded increasing sequence 
and so lim x,,, exists. Similarly lim x,,, exists. It now follows from 


no mM >o 
theorem 4.72 that lim (lim x,,,,) and lim ( lim Xmn) exist and are 
mean nO nD Mm, 
equal to x. 


(ii) Suppose that lim (lim x,,,,) exists. Let 


m—00 nD 


a me et = 1, 2, ..5) 


nro 


and let lim y,, = y. Since Xm», increases With 1, Xin < Ym for all 


Mm >O 
n; and since x,,, increases with m, y,, increases with m also and so 


for all m,n. It is easy to see that a bounded double sequence which 
increases with each index converges (exercise 4(g), 5). Therefore 
lim Xn exists. By (4), am. Chim. Xmn) also exists and all three limits 


m, N>O 


are equal. | 


The notion of a double limit is applicable to functions on R? (i.e. 
functions of two continuous real variables). For a given function /, 
we may consider f(x, y) as x, y tend to infinity or to a finite limit. 
Also one variable may be continuous, the other confined to integral 
values. In all these cases analogues of the last three theorems hold. 


Double series. Given the double sequence (u,,,) of complex terms, 


= > > Us; (m,n = ED ees 


t=1j=1 


let 


If S1n —> 5 aS M, Nn > 00, we say that the double series 


converges to S. 
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Theorem 4.71, interpreted for double series, takes the following 


form: If > Umn converges to s and, for each m, > Umn CONVErZES, 


m,n=1 n=1 


then s (> x Una) converges to s. We use the series form of theorem 


m=1 \n=1 


4.73 to prove a more general result. 


Theorem 4.74, Let (Umn) be a double sequence of complex terms. If 
one of the series 


Eble 5,(3 bes EE, al) 9 


converges, then all the series 
ere oe 
m,n=1 m=1 n=1 =1 
converge and have the same sum. 


Proof. 
(i) First suppose that the w,,,, are real. Let 


so that Dn» and dnp are ee in. = po. ane 


O< Pins Inn < oe 
If now one of the series (4.73) converges, so does the corresponding 
series with |u,,,| replaced by Pn» (by the comparison principle or its 
extension to double series—see exercise 4(g), 6). Then, by theorem 
4.73, 00 90 <0) o) 
2 Pmns ® Pnn) p> & x Pan) 
m, n=1 m=1 \n=1 n=1 
all converge and have the same sum. This is also true for the corres- 
ponding series of g’s and so subtraction gives the result for series of 
real terms. 
(ii) When the u,,,,, are complex, let 


Unn = Unn +lWmns 
where v,,, and w,,,, are real. Since 
lOmnls |[Wmnl < [Umals 
if one of the series (4.73) converges, then the corresponding series 
with |u,,| replaced by |v»,| and |w,,,| also converge. Hence, by (i), 


the real and imaginary parts of u,,, have the desired property and 
this proves the theorem. | 
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Exercises 4(g) 
1. Find, when they exist, the double limit lim x,,, and the repeated limits 


Mm, W—->oO 
lim (lim Xmn), lim ( lim Xm) of the sequences (X»,) with the following 


m—>o nO n—->oOo MmM—->@O 


(m, n)th terms. 


; —n .. mtn 

() m+n? (a) mm? 

= : 1 

Gi) Gv) (7 +7), 
mn . m =! G 

ra a Ss eee 7 (142). 


2. Give an example of an unbounded, convergent double sequence. 


3. Prove, without using double limits, that if x,,, increases (decreases) with 


each of m and n, then the existence of one of lim (lim x,,), lim (lim Xp) 
mao nN—->o no m->O 


implies the existence of the other and also the equality of the two limits. 


4. General principle of convergence. Show that a necessary and sufficient con- 
dition for the (complex) double sequence (x7) to converge is that, given e > 0, 
there is an my such that |Xnn—Xmn'| < € if m, m’,n,n’ > No. 


5. The double sequence (x,,,) of real numbers is bounded and Xm, increases 
(decreases) with each index. Prove that (Xm) converges. 


6. Comparison principle. Suppose that >) umn. >) Umn are double series of 


m, 7 m,n 
positive terms, that umn < Umn for all m,n and that >) vm_, converges. Show 
m,n 
that >) umn converges. 
m,n 


7. The double series }) umn of complex terms is said to converge absolutely if 


m, TN 
> |Unn| converges. Show that an absolutely convergent double series converges. 
m,n 


8. Show that the double series >, one 
m, n=1 m*n' 
converges if and only if k > 1 and/ > 1. 
9. Show that the double series 3 
S 1 
mone 1 (m+n)* 


converges if and only if k > 2. 


s ; 
10. The double series >) wm, converges absolutely and, for r = 1, 2,..., 
m,n=1 
vr = D> Umn = Uy, 7-1 + Ug, pot... F Up—1,1- 


M+n=Pr 
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Prove that PS Se epee ee 
Deduce that, for k > 2, 


= 1 a 
oe eet 
m, n=1 (n+n) n=1 


11. Let w, w’ be non-zero complex numbers such that w/w’ is not real. Show that 
the double series = 1 


m, n=1 (mo+ nw’) 


converges absolutely if and only if k > 2. (For the present, k must be rational; 
general complex powers are introduced in chapter 10.) 


fe @) 
12. Prove that the double series >) 2z”” converges absolutely for |z| < 1. 


m,n=1 
Hence prove that, when |z| < 1, 
alee z™(1+2") 


Parse” - gant 


fo @) 
13. The series 2; Gat” ond. >, 6.2" 
n= 


both have radius of convergence R(> 0). Their sums are denoted by f(z) and 
g(z) respectively. Establish, in a disc to be specified, the formula 


x a,2(2") = x b, f(z"). 


NOTES ON CHAPTER 4 


§4.1. The first mathematician to use the O,o notation systematically was 
FE. Landau (1877-1938). P. Du Bois-Reymond (1831-89) had previously used a 
more elaborate notation (which included ~) for comparing the rates of growth of 
two increasing functions which tend to infinity. (See G. H. Hardy, Orders of 
Infinity (Cambridge, 1910).) 


§4.2. The upper and lower limits of a sequence of real numbers could be 
defined as the largest and least numbers, respectively, to which subsequences 
converge (with suitable provision for infinities). However this procedure would 
require an initial proof that such numbers exist. The method we have adopted, 
though less intuitive, is technically simpler. 

Filters. In analysis the idea of a limit appears in many forms each of which is 
usually treated individually. In this book we have separately defined limits of 
sequences in metric spaces and of functions from one metric space into another, 
and other instances occur in C1. This approach, apart from being traditional, is 
best suited for an introductory account and we have again adopted it for defining 
the several different guises in which upper and lower limits appear. However a 
unified treatment of the limit process is possible. 

Let X be a non-empty set. A non-empty set ¥F of subsets of YX is called a 
filter on X if the following conditions are satisfied. 
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Fl. If A, Be F, then AN Be F. 

F2. if Ac Fand B >A, then Be FJ. 

F3. o2€¢F. 
Note that, by F2, X ¢ F. It is easy to give examples of filters. 

(i) In P, the set of positive integers, the sets A such that P—A is finite form 
a filter. 

(ii) In R}, the collection of all sets A which contain an interval (x, 0) is a filter. 

(iii) Let (Xo, p) be a metric space, let a be a fixed limit point of X, and let 
X = X,— {a}. A filter on X may be defined as the collection of all subsets A of X 
which contain a set B(a; r)— {a} for some r > 0. 

(iv) Let F be a filter on a set X. If Ae F, denote by the subset of F 


defined b 
: SP ={EEF\|E> A}. 


Then % ={B- F\ZB > A for some Ac F} 


is a filter on F. 
Now let ¥ bea filter on a set X and let (Y, 7) be a metric space. We then say 
that 
lim f(x) = Yo 
F 


(where y, € Y) if, given ¢ > 0, there is an A €¢F such that o(yo, f(x)) < € for 
allxeA. 

Clearly the filter in (i) yields the limit of a sequence of points in (Y, 7); and it 
is easy to see that (ii) and (iii) lead to other familiar situations. For instance if, 
in (iii), X = (a, ©) or (—©, a), we obtain a one-sided limit at a. 

By formulating the limit concept in such generality one sacrifices, not sur- 
prisingly, some of the properties traditionally associated with limits. However 
these may be restored by the imposition of quite mild restrictions on Y. 

Finally, let f be a bounded, real-valued function on a set X equipped with a 
filter A. Then the relation 


P(A) = sup tts) (Ae F) 


gives a bounded real-valued function ¢ on ¥ and it may be shown that lim ¢(A) 


F 
exists, where % is the filter on F defined in (iv). This limit is defined as the upper 
limit of f with respect to F, i.e. 


lim sup f(x) = lim (sup f(x)); 
F FG zed 
and similarly lim inf f(x) = lim (inf f(x)). 
F § xed 


In this sketch we have not considered infinite limits of real-valued functions. 
These present no problems of definition, but they are most simply incorporated 
in the theory of filters when topological rather than metric spaces form its back- 
ground. A simple account of filters in this setting is given by F. Smithies in his 
expository article ‘Abstract Analysis’ (Mathematical Gazette, 35 (1951), 2-7). 
The equivalent theory of directed sets is also mentioned in this paper. Still another 
general theory of limits is described by E. J. McShane in Studies in Modern 
Analysis (edited by R. C. Buck). 
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§ 4.3. In C1 and so far in this book no formal definition of the infinite series 
> u,, has appeared. To fill this gap we write 
: Sy = Uyt...+Uy, (n = 1) 
and say that the series is the ordered pair of sequences ((u,), (Sn)). 
There is no confusion in practice in the double usage of the symbol s we. 


first as designating the series formally defined as ((u,), (s,)) (without any implica- 
tion as to its convergence), and secondly as the number which is the sum of this 
series when it does converge. 

Similar remarks apply to double series. 


rs) 
UNIFORM CONVERGENCE 


5.1. Pointwise and uniform convergence 
Consider a sequence of functions 


fie Yo fe = 1,2, 2), 


where (Y, o) is a metric space. For the moment the space X need not 
carry a metric. 
Suppose that there is a function f: ¥ > Y such that, for each x 


ag Ere ee, 


as n > co. We then say that (/,,) converges to fon X, or, more speci- 
fically, (f,,) converges pointwise to fon X. 


Illustrations 
In each of these, X is a subset of R! and (Y, c) is R! with its usual metric. 


nx 
(iis te ice for x > 0. 


Ir{x) > X. 
di) f.0) = 3" for Q <= xe 1. 


f(x) > f(x), where f(x) = 0 for 0<x<1 and f(1) = 1. 


= nx 
Gi) f,(x~) = a for x > 0. 


f,(x) > 0. 


We observe that in (ii) the sequence of functions f,, each con- 
tinuous on [0, 1], has a limit function which is discontinuous at the 
point 1. In this example pointwise convergence is not a strong 
enough requirement to ensure that continuity is carried over from 
the functions f,, to the limit function f 

The central theme of this chapter is the transmission of properties 
of the individual functions of a sequence to the limit function. For 
this investigation we define a mode of convergence of a sequence 
of functions, stronger than pointwise convergence. 
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Definition. Suppose that f,,: X > Y is a sequence of functions on a 
set X to a metric space (Y, 7). The sequence (f,,) is said to converge 
uniformly to f: X > Y if 


i o(f,(x), f(x)) >~0 as n>o., Ca 


The distinction between pointwise and uniform convergence is then 
that, in the former o(f,,(x), f(x)) tends to 0 asm > oo for each separate 
point x and, in the latter, the supremum of all these values tends to 0. 
It is plain that uniform convergence implies pointwise convergence 
(to the same limit function). Illustration (ii) shows that convergence 
may hold at every point of a set, here [0, 1], but since, for every n, 


SUP. fin) —f(x)| = I, 


the convergence is not uniform. 

The e-N criterion for the condition (5.11) is often useful. Converg- 
ence for each separate x in X means that, given e > 0, there exists 
N = Ne, x) such that o(f,(x), f(x)) < ¢ for all n > N. The con- 
vergence is uniform if and only if it is possible to choose N = N(e), 
independent of x, such that 


o(f, (x), f(x) < ¢ foralln 2 Nand allxe X. 


We shall examine the illustrations (i), (ii), (iii) for uniformity of 
convergence. Since XY < R! and Y = R!, they lend themselves to 
graphical representation. If f, > /f uniformly on X, the graph of 
y =/f,(x) for all m greater than N(e) lies inside the strip of the (x, y) 
plane between the curves y = f(x)+e and y = f(x)—e. The fact 


Cy ate 
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that pointwise convergence does not imply uniform convergence 
means that the eventual nearness of f,,(x) to f(x) for each individual x 
does not imply the eventual ‘global’ nearness of the curve y = /,,(x) 
to the curve y = f(x). 

In (i) above, f,(x) —f(x) = —x?/(n+.x). In any finite interval, say 
(0, a], 


sup |f,(x)—f(x)| < “>0 asi n>, 

0<a<a nN 

and so f,, > f uniformly on [0, a]. 
Suppose, however, that x is free to take any value in [0, 00). Since 


sup Ful) —F@) 


the convergence is not uniform in the infinite interval [0, 00). 

(ii) We have already shown that the convergence of (x”) is not 
uniform in [0, 1]. This fact is also intuitively clear. The limit function 
fhas a jump of amount | at the point 1 and so, if < 4, none of the 
continuous curves y = x” can lie in the two disconnected pieces 
represented by f(x)—€ < y < f(x)+e(0 < x < 1). It will be proved 
in theorem 5.21 that any sequence of continuous functions with a 
discontinuous limit must converge non-uniformly. 

The reader should prove that in any interval [0, 1-0] which 
excludes the point 1 the convergence of x” to its limit is uniform. 

(iii) This example is more subtle than (i) or (ii). The function f, 
has a maximum at the point 1/n and f(1/n) = 4. We see that 


sup f(x) —f(2)| = $ 


and the convergence is not uniform. 


= 0, 


= 


n=z8 


oS 
vl 
he 

- 

al 
Qo! 

| 
— 
~ 
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Each f,, is continuous everywhere in [0, 00) and so is f. Thus uni- 
formity of convergence is not a necessary condition for the limit 
of continuous functions to be continuous. 


Another example exhibits the notions of pointwise and uniform convergence 
in a more abstract setting. 

Let C be the set of continuous and bounded real functions on R? and let p be 
the metric defined by 


rth 
= sup | [an a 


0<h<l 


(¢, ve C). 


(It was shown in exercise 2(a), 2 that p is a metric.) 
For n > 1, we define the functions f, : C > C by 


FP) = (x47) eC, xe RY. 
Now take any ¢ € C and let Ky = sup |d(x)|. We have 


Io 3) 9a 


PAP), $) = sup 


—-o<7< @ 
0<h<l 
c2t+h+(i/n) xz+(1/n) 
=  s$up [ d(t)dt— [ p(t)dt 
—o<7<O | JIrt+h x 
0<h<l 
2 
< —K,>0 as n->©O. (5.12) 


So, for all de C, f,(¢) > ¢ as n> ©; in other words, f,, > J pointwise, where I is 
the identity function on C (i.e. J(¢) = ¢ for all Ge C). It also follows from 
(5.12) that, if K is any positive number and Cx is the subset of C consisting of all 
@ such that Ky < K, then 


sup p(f,(9), 1(¢)) < — K. 
PECK 


siN 


Hence f,, > J uniformly on Cx. 


We add three further notes. 
(1) In the two paragraphs preceding theorem 3.44 the usual metric 
in the space B(X, Y) of bounded functions f: X¥ >(Y,o) was 


defined by PCF, 8) = sup o(f(>); 8(*)). 


Thus, for functions in B(X, Y), uniform convergence is identical 
with convergence in the metric space (BCX, Y), p). 
If, in particular, Y is R', then 


pf 8) = sup [/(x)—2()] 


and, for bounded real valued functions, uniform convergence is the 
same as convergence in the metric space (BCX), p). 
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(2) We have assumed the set of functions on X to Y to be indexed 
by a positive integer n which tends to oo. It is easy to adjust the 
definition of uniform convergence to cover, say, a double sequence 
indexed by (m,n) or a set of functions determined by a continuous 
variable ¢ which tends to a limit f, or to ©. 

(3) Convergence and uniform convergence of series. 

A statement about convergence ofa series ») u, of real or complex 


n=1 


valued functions on a set X is a translation of the corresponding 


n 
statement about the sequence s,, where s, = > u,. 
r=1 


Illustration. For n = 1, let u, be defined on the set of complex numbers z 


satisfying |z| < 1 by uz) = ze 


1-—z 1 
= ets = s(z), say, 


cO 
and >) u, converges to s at each point of the disc |z| < 1. 
1 


Then S,(Z) = 


The convergence is not uniform because, for every n, 


Blo OE 


sup |s,(z)—s(z)| = Sup 
\2| era A 


The convergence is, however, ae on |z| < 1—0, where d is any fixed 
positive number. For we have 


= (1 —3)” 
6 
Theorem 5.41 will show that any power series converges uniformly on a closed 
disc inside its circle of convergence. 


>0O as n->o. 
1-—z 


up 
jzi<1—6 


Exercises 5(a) 


1. The functions f,, fon XU Y are such that f, > f uniformly on X and f, ~ f 
uniformly on Y. Show that f, > funiformly on XU Y. 


2. If complex valued functions f,, g, tend respectively to f, g, uniformly on X, 
show that, for any constants «@, f, af,+fg, >af+fg uniformly on X. Is it 
necessarily true that f, g, ~ fg uniformly on X? 


3. The sequences (f,) on (0, 1], (g,) on (0, 1), (A,) on [0, 100] are defined by 


sin nx 


IO cee oa. En(X) _ nx ’ h,(x )= 


mo 


Prove that all three sequences converge pointwise; which of them converge 
uniformly ? 


4. Examine for uniformity of convergence on [0, 1] the sequences (f,), (gn) 
defined b 2 
eos FiAx) = x"1—x"), en(x) = nxe”. 
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5. Let (r,) be a sequence consisting of all the rational numbers and, for 
n = 1,2,... define the functions f, on R! by 
jee ae gee ge 


f(x) = | 


0 otherwise. 
Prove that (f,) converges pointwise, but not uniformly on every interval of R’. 


6. Let Q be the set of rational numbers in (0, 1) and define the function fon Q 
by f(p/q) = 1/q when p,q are coprime positive integers. Let (r,,) be a sequence 
consisting of the members of Q and, for n = 1, 2,..., define f, on Q by 


_ (fQ) if xe tn, ..., Mas 
PS opting Oth, +005 Tn}. 


Show that f, > f uniformly on Q. 


7. Examine for pointwise and uniform convergence the sequences (/,,), (Zn), (An) 
of complex functions defined by 


sin nz 
eo 


=\", gz) =e”, hz) = 


8. It is given that, for every continuous real function f on [0, 1] for which 
f(O) = 0, the sequence (/s,,) converges uniformly on [0, 1]. Also lim_ s,(0) exists. 


n> 


Decide whether the sequence (s,,) must converge uniformly on (0, 1]. 


fla = 


9. Forn = 0,1, 2, ..., the functions up, Vn, Wn on [0, 1] are given by 
AX) 2 NL = xX), O,{x) = I= xP, a) = (Tee 


Prove that the series Xu,, Xv,, &w, all converge pointwise; which of them 
converge uniformly ? 


10. Let X < R* be the rectangle [a, b] x [c, d], where a > 0, c >0. The functions 
f,. X > R' are given by 


Show that (f,) converges uniformly on X. 


11. The set X consists of the real functions ¢ on [a, 5] such that on [a, 5] 
(i) 9’ exists and is continuous, and 


Gi) |S@d|, |¢’@)| < M. 
For n = 1,2, ..., the functions f,: X — R' are defined by 


f.(9) = [ d(x) sinnxdx (ge X). 


Prove that the sequence (f,) converges uniformly on X, 


12. Let (X, p), (Y, ©) be metric spaces. Let X, be a subset of X and x, a limit 
point of X>. Suppose that the functions /,: X > Y converge uniformly on Xo 
and, for each n, 
TAX or in as be Xo: 


Show that, if (Y, 7) is complete, then the sequence (y,,) converges. 
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5.2. Properties assured by uniform convergence 
Illustration (11) of §5.1 shows that the limit function of a sequence 


of continuous functions may be discontinuous. We now prove that 
this cannot happen when the convergence is uniform. 


Theorem 5.21. Let (X, p) and (Y, o) be metric spaces. Let the sequence 
of functions f,,: X > Y converge to f, uniformly on X. If c is a point 
of X at which each f,, is continuous, then f is continuous at c. 


Proof. (The argument is essentially the same as was used in 
theorem 3.45.) 
Given € > 0, there is m such that 


sup O(fu(x),S()) < €. 
Since f,, is continuous at c, there is a d > O such that 


o(f,.(x), f,(c)) < € whenever p(x, c) < 0. 
Hence, u oy, c) =< -6, 


O(F(X),f() < TF) Ful) + OT)» Ful) + 7, MOQ) < 3 


and so f is continuous at c. | 


Corollary. If the series Xu, of real or complex valued functions con- 
verges uniformly to s on (X, p) and if each u,, is continuous at c, then 
S is continuous at c. 


Proof. Each s, = u,+...+u, is continuous at c. | 


The hypothesis of uniformity of convergence which has been 
proved in theorem 5.21 to be a sufficient condition is not a necessary 
condition. This is shown in illustration (111) of §5.1. We shall prove 
in theorem 5.35 that, if Y is R! and if the sequence (/,,) is monotonic, 
then uniformity of convergence is necessary for the continuity of fC 

Repeated limit problems. In theorem 5.21 continuity of f at ec, 
when c is a limit point of X, means that 


lin f (x) = f(C). 


The left-hand side is lim lim f(x), where the inner limit is in the a 


x—->C N->D 


metric and the outer in the p metric. The term f(c) on the right-hand 
side is lim f,(c), and, if each f,, is continuous atc, this is the same as 
n—>oo 


lim lim f,,(x). 


nO @-c 
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Hence the conclusion of theorem 5.21 is that in whichever order 
the two limiting operations 
lim and lim 
no XL—>C 
are applied to f,(x) the result is the same. 

The problem of inverting the limits m > oo and n > oo in a double 
sequence has already been discussed in §4.7; the solution there lay 
in the sequence being monotonic in m and in n. 

A common application of the notion of uniformity is to justify 
the inversion of the order of repeated limits. Theorems 5.22 and 5.23 
will provide further instances of this. The reader will see that it is 
appropriate to specialize the arbitrary metric spaces of theorem 5.21 
to the space R}. 

We consider sequences of real functions integrable over a finite 
interval. A pointwise convergent sequence of this kind need not have 
an integrable limit function; and, even if the limit function has an 
integral, the sequence of integrals need not converge to it. 

To justify the first statement we take a sequence of functions f, on 
[0, 1] defined as follows. If (7,,) is a sequence consisting of all the 
rational numbers in [0, 1], let 


lo Ss = ee 


7, n(X) = | 
Then f,, >/, where 


QO otherwise. 


FO [ 1 if x is rational, 
X) = 

| 0 if x is irrational. 
Now each function f, is integrable over [0, 1], since every lower sum 
s(Y) is 0 and the upper sums S(Y) tend to 0 as the length of the 
largest subinterval of D tends to 0 (see C1, 123). However / is not 
integrable over [0, 1] (see C1, 122). 
For the proof of the second statement take the sequence of 
functions g,, on [0, 1] where 


g(x) = nxe-™. 


2, > Z pointwise, where g(x) = Ofor0 < x < 1. Also 
1 
|, gu@)de = 40 -e*) +4, 
: | 


1 
while | g(x)dx = 0. 
0 
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It is left to the reader to show that neither of the sequences (f,,), 
(g,,) converges uniformly (cf. exercises 5(a), 4, 5). 


Theorem 5.22. Let (f,,.) be a sequence of real functions integrable over 
the finite interval [a, b]. If f,, > f uniformly on |[a, b\, then 
(i) f is integrable over [a, b], and 


b b 
di) [>] sr 
Proof. Given € > 0, let N be such that 
SUP, fx) -f(x)| < e for no QN. 
(i) Fora<x< BJ, | 
Iy(x)—€ < f(x) < f(x) + 
and so, when @ is any dissection of [a, 5], 
s(Z, fy) -eb—-a) < SZ,f) < SZ.) < SG, fy) +e(b—a). 
Thus S(2,f) -(ZB.S) < SQ, fy) —0(Q, fy) + 2€(b — a). 
But, since fy is integrable over [a, 5], | 
S(Q, fx) -(F,fn) < € 

if the length of the largest subinterval of @ is sufficiently small. For 
ceeeries. S(D,f)-(Q,f) < {1 +2%b—a)] 


and therefore fis integrable over [a, 5]. 
(ii) For n 2 N, by Cl, 1276), 


\Pn-PA =|P Ga] <-a. | 


Corollary. (Term-by-term integration.) If Xu, converges uniformly 
to s on [a, b] and if each u,, is integrable over [a, b\, then s is integrable 


and _ : 
[s-={ i. 


We note that uniform convergence is not necessary for the validity 
of term-by-term integration; this is shown by illustration (ii) of §5.1. 

Theorem 5.22, proved for a finite interval, is false for an infinite 
interval. (See exercise 5(b), 5.) 

In the last two theorems of this section we deal with sequences of 
functions differentiable in an open or a closed interval. When the 
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interval is closed, derivatives at the end points are taken to be one- 
sided: if the interval is [a, 5], f’(a) and f’(d) are defined by 


Pd) = tim LEADLO, pQy = tim O=LOXM. 


Theorem 5.23 A. Let (f,,) be a sequence of real venkene differentiable 
on a finite closed interval [a, b|. Suppose that (f,,(x)) converges for at 
least one x & [a, b] and that (f,,) converges uniformly on [a, b|. Then 

(i) (f,,) converges uniformly on {a, 6}, 

(ii) f = lim f, is differentiable in [a, b| and, for all x € [a, 5], 


f(x) = lim f,@). 
Proof. Let lim f, = g and take any e > 0. Since (/,) converges 
uniformly on [a, 5], there is an N such that 
sup |[f.(x)-2g(x)| <e¢ for n2>N (5.21) 
axgxr<b 


and therefore 
sup | fn(x) —fin(x)| < 2e for m,n > N. 


If then £,, £ are any points of [a, b], we have by the mean value 
theorem for f,, —/fns 


{fn(E1) —Sn(ED} — n(E2) —Sim(E2)} = (E1— $2) Ufn(S) —Sm(E)}» 


where & lies between £, and &,; and so, for all m,n > N, 


Fal) —Sm(ED} — Uno) —fmlEn)}] < 2[1—Esle. (5.22) 
(i) If x) is a point for which (f,,(xo)) converges, there is an NV’ such 
a f(x) —felxo)] <€ for mn > N’. / (5.23) 


Now let x be any point of [a, b]. By (5.22) with &, = x and & = x) 
and by (5.23) we have, for m,n > max (N, N’), 
[Fn(x) —Sn()| < (L+2|x—x))e < {1+26-aje (5.24) 


and so lim f,,(x) exists for all x €[a, b]. Letting m — oo in (5.24) we 
see that (/,,) converges uniformly on [a, 5]. 

(ii) Let c be an interior point of [a, b] and take any h + 0 such 
that c+he [a, b]. By (5.22), with £; = ct+hand & = c we have, for 


mone N : 
RSS TL CPD) AT AC) fhe Fee) 
ae a a = 26. 25) 
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Since Srl +h) —fmlo) > fle +h) fo) 
as m — 00, (5.25) shows that, for all +0 such that c+he [a, 6], 


Sicsh ie Berk So <2e for n>N. (5.26) 


By the definition of the derivative f/\(c), there is an 7 > 0 such that 


fileB fel —fr(c)| <¢ when 0 < [hl <7. (5.27) 


Then, by (5.26), (5.27) and (5.21), 
flexD FO _ a) 


< ‘ (c+ : LO) _frle —: Sul) 


Fn(e+ 2 —fn(C) = {pe) 


+| fx) -g(0)| 


< 4e 
when 0 < |h| < 7. It follows that /’(c) exists and is equal to 
g(c) = lim f,(c). 


If c is one of a, b, the above proof applies except that 4 must be 
restricted to either positive or negative values respectively. | 


In dealing with open intervals we can assume rather less than 
uniform convergence on the whole interval. 


Theorem 5.23B. Let (f,) be a sequence of real functions differentiable 
in a finite or infinite open interval (a, b). Suppose that (f,(x)) converges 
for at least one x € (a, b) and that (f,) converges uniformly on every 
finite closed subinterval of (a, b). Then 

(i) (f,,) converges uniformly on every finite closed subinterval of 
(a, 5), 

(ii) f = lim f, is differentiable in (a, b) and, for all x € (a, 5), 


f(x) = lim fi@). 
Proof. Let x, be a point such that (f,(%»)) converges, let c be any 


point of (a, b) and let [«, £] be any finite, closed subinterval of (a, 5). 
There is now a finite, closed subinterval [p, q] of (a, 5) such that 


Xo cE(p,g) and [a, £] < [p,q]. 


114 UNIFORM CONVERGENCE (5.2 


Then, by theorem 5.23 A, (f,,) converges uniformly on[p, g] and hence 
on [a, #]. Also, if f= lim/,, then /’(c) exists and is equal to 
lim fi(C). | 


Notes. (1) We can rewrite 5.23A and B as theorems on the term- 
by-term differentiation of infinite series. 

(2) The uniformity of convergence of (f;) is not a necessary 
condition that /’(x) = lim /,(x). A counter-example is provided by 
exercise 5(b), 7. 

(3) The proof of theorem 5.23A is hard. If, however, we add to 
the hypothesis the requirement that each f;, is continuous, the 
theorem is easily deduced from theorem 5.22 as follows: 

By theorem 5.21, f;, tends to a continuous limit function, g say. 

By theorem 5.22, 

[-g = lim |p, 


a nNn—-> oo 


lim {f,2)-fyla)} 
fe) -$@) 


Since g is continuous, the left-hand side has derivative g(x). Hence 
g(x) = f(x), as proved in theorem 5.23A. 

(4) Each of theorems 5.22 and 5.23 exhibits uniformity of con- 
vergence as a sufficient condition for inverting two passages to a 
limit. Differentiation and integration are both limiting operations 


b 
(the integral | f being the limit of approximative sums—C1, 123). 


The conclusions of the theorems are respectively 
‘ee «ft 
|, im 4) = Jim ff, 
and = lim f,(x) = lim = FAX) 
dx n—>0 s = n—->0 rs alae 
or, expressed as term-by-term differentiation of series, 


d — 
FD tal) = Eyl), 
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Exercises 5(d) 
1. Show that the series x x"(1—x”) converges pointwise, but not uniformly 
on the interval [0, 1]. 
2. The function f, on R! is given by 
fi{x) = 


x” 
1+ x2" 


Show that (7) converges uniformly on the interval [a, 5] if and only if [a, b] does 
not contain either of the points 1,—1. 


3. The real function ¢ on [0, 1] is continuous and f, on [0, 1] is defined by 
Sil(x) = x"O(x). 
Prove that (/,) converges uniformly on [0, 1] if and only if (1) = 0. 


4. Give examples to show that a sequence of integrable functions with an 
integrable limit function may be such that the sequence of integrals (a) diverges 
to o, (b) oscillates finitely. 


5. (i) Construct a sequence of functions f, converging uniformly to a function f 
on an interval [a, 0] such that [2 fF, exists for each n, but a f does not. 
(ii) Construct a sequence of functions g, converging uniformly to a function g 


on an interval [a, 00) such that a ety ae TD: |.) aed 1 g exist, but 
a a 


[ | & 
a a 


6. The functions f, on [0, 1] are given by 


Fr(x) = a oe ee 


Find for what values of p the sequence (f,,) converges uniformly to its limit f- 
1 1 
Examine whether | i ~| f for p = 2: and for p = 4, 
0 0 


7. The functions f, on [0, 1] are given by 
nx? 
In{x) = 1+n3x°" 
Show that the sequence (/,) does not satisfy all the conditions of theorem 
5.23 A, but that the derivative of the limit function exists in [0, 1] and is equal to 
the limit of the derivatives. 


8. The functions f, on [—1, 1] are defined by 


fr(x) = 


1+ a 


Show that (f,) converges uniformly and that the limit function fis differentiable, 
but that the relation f’(x) = lim f(x) does not hold for all x in [—1, 1]. 
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9. The functions f, on [0, 2] are given by 
fie = G45". 


Prove that the sequence of functions /,, differentiable on [0, 2], converges 
uniformly to a limit function which is not differentiable at the point 1. 


5.3. Criteria for uniform convergence 


Uniform convergence of a sequence or series of functions can 
often be established by an extension of a criterion for the convergence 
of a sequence or series of numbers. 


Theorem 5.31. (General principle of uniform convergence.) A necessary 
and sufficient condition for the sequence (f,,) of real or complex valued 
functions on the set X to be uniformly convergent is that, given é > 0, 
there is an Ng such that 


sup | fin(x) —f,(x)| < € whenever m,n > nN. (5.31) 
TEx 


Proof. 
(i) If (f,,) converges uniformly to fon X, then, given e > 0, there 
is an mM) such that 


sup | f,(x)-f(x)| < 4¢€ when n> nm 
reX 


and (5.31) is an immediate consequence of this. 

(ii) Suppose that, for every e > 0, there is an m) such that (5.31) 
holds. Then, for every x, the sequence (/,(x)) of real or complex 
numbers is a Cauchy sequence and so, since R* and Z are complete, 
lim f,,(x) exists. Thus there is a function fon X such that 


F(x) >f(x) for every xe X. 


We shall show that f, > f uniformly. 
Take any € > O and let nm be such that (5.31) holds. Now let x 
be any point of XY. Then, when m,n 2 ny, 


| Fin(X) —f,(x)| = 
M@)—f@)| = lim |fn®)—Fi@)| < € for n> no 
Since this holds for all x € X, 


sup |f(x)-f,(x)| <¢ for n> nN. 
reX 


and so 


Therefore f,, >f uniformly on X. | 
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It will be noticed that the proof of the sufficiency is merely part 
of the proof of theorem 3.44 on the completeness of the space 
mx, 1). 
We shall now use theorem 5.31 to establish a number of con- 
ditions ensuring the uniform convergence of series of functions. 


Theorem 5.32. (Weierstrass’s M-test.) Let the: functions uy, Us, ... On 
X be real or complex valued. If, for each n, there is a constant M,, such 
oi lun(x)| <M, (xe X), 
where =M,, converges, then Xu,, converges (absolutely and) uniformly 
on X. 
Proof. Let s, = Uj+...+u,(m = 1, 2, ...). Since 2M, converges, 
given € > 0, there is an m such that 
M,sit---+M, < € whenever m>n2 nN. 

Hence, form > n = MN, 

sup oii—s{x). < Mot. tM, < € 

LE 


and so, by theorem 5.31, Zw, converges uniformly on X. 


The M-test can serve only for absolutely convergent series. 
Theorems 4.53 and 4.54, however, have useful analogues for series 
which are uniformly but not absolutely convergent. 


Theorem 5.33. (Dirichlet.) The functions a, Vv, on X are real valued 
and satisfy the following conditions: 


(i) the sequence s,(x) = > a(x) is uniformly bounded on X, (i.e. 
r=1 


there is a constant H such that |s,(x)| < H for alln and all x € X); 

(ii) for each x, (v,(x)) is @ monotonic sequence ; 

(iii) v,,(x) +0 as n > 0, uniformly on X. 
Then Xa,V», converges uniformly on X. 

Proof. By (i), for k > 0 and all xe X, 

(CANE 9 bee “" + An+4(X)| = Sti) —5S,(x)| < 2H. 

Let ¥ = X,U X,, where, as n increases, v,(x) decreases for xe X, 
and increases for x € X,. Then, by the corollary to theorem 4.52, 
whenever p 2 l, 


ln s1(X) Un ga(X) + .-- + Ansp(*) Unip(X)| < 2H vp +1(%) 
for all xe X;. Since v,(x) >0 uniformly on Xj, it follows from 
theorem 5.31 that the series La,v, converges uniformly on ;. 
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Similarly the series converges uniformly on X, and therefore on 
X= Xi U Xy (exercise: 5(a), 1)2 | 


Theorem 5.34, (Abel.) The functions a,, v, on X are real valued and 
satisfy the following conditions: 
(i) the series Xa,(x) converges uniformly on X; 
(ii) for each x, (v,(x)) is a monotonic sequence; 
(iii) v,(x) is uniformly bounded on X. 
Then Xa,v, converges uniformly on X. 

Proof. Let M be such that |v,(x)| < M for all xe X and all n. 
Again denote by X, and X, the subsets of X on which u,,(x) decreases 
and increases respectively. We may take v,(x) > 0, since we could 
consider M+v,(x). By (i), given ¢ > 0, there is an my) such that 


|an41(X) + ..e FAns(X)| < € 
for n > m, k > O and all xe X. Then, again by the corollary to 
theorem 4.52, 


|an+1(X) Un41(X) Fis TF An+(X) Un+p(X)| - Un 41(X) S eM 
for n > M, p > Oand all x € X,. Hence by theorem 5.31, Xa,,v,, con- 
verges uniformly on X,. Uniform convergence on X, is obtained 
similarly. | 


The above proof is independent of theorem 5.33, for the type of 
argument used to deduce theorem 4.54 from theorem 4.53 is now 
inapplicable. (Why ?) 

A particularly simple and important case of theorem 5.34 occurs 
when a,(x) = a, (independent of x) and Xa, converges. 

Example. The series 5 > . 

When p > 1, by the M-test, the series converges uniformly on 
(— 00, 00), 

When 0 < p < 1, the series converges uniformly in every interval 
[a, 2] where 2k < a < £ < 2(k+1)7, but not in any interval con- 
taining a point 2km (k = 0, +1, +2,...). We prove this. 


> sin rx is uniformly bounded on [«, 4] for, when « < x < £, 


S ence (n+4)x—-cos 4x - | 
vat . 2sin 4x |sin 4x| 
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Also 1/n? decreases and tends to 0. Therefore, by theorem 5.33, 
X(sin nx)/n? converges uniformly on [a, /]. 

The convergence is not uniform in any interval including a point 
2km. For, when x = 2km7+7/(4n), 


a 2 ees Bee 1 1 
en pr? > thes 


2n sin rx : 
>: 


r=nt+1 7 Z 
(since sin @ > 20/7 when 0 < @ < 4m) and the necessary condition 
of theorem 5.31 is not satisfied. This is a recognized technique for 
establishing non-uniformity. 

The last theorem of the section is a partial converse of theorem 5.21. 


Theorem 5.35. (Dini.) Let (f,,) be a convergent sequence of real valued 

functions on a compact metric space (X, p). If, for every x € X, the 
sequence (f,(x)) is monotonic, and if all the functions f,, (n = 1, 2, .-.) 
and f = lim f,, are continuous on X, then f,, > f uniformly on X. 


Proof. Take any ¢ > 0. Given any point c € X, there is an integer 


Ne such that fa) —f(c)| < €. 
Since f,,, —f is continuous, there is an open ball B, with centre c such 
eet nd) -F 09) < 2€ 


for all x € B,. Also | f,(x) —f(x)| decreases and so 
fr) -f(X)| < 2e 
forn >n,and xe B,. 


As X is compact, by the Heine-Borel theorem 3.7, there is a 
finite number of points c,, ..., c, such that the open sets B,,, ..., Be, 


cover X. Hence, if N = max (Nes erg Node 


then fn(x) —f(%)| < 2€ 


for alln > NandxeX. | 
By taking X = [0,1) and f,(x) = x” we see that without the 
condition of compactness the conclusion of the theorem may be false. 


Exercises 5(c) 


1. Examine for convergence and uniformity of convergence the (real) series 
whose nth terms are 
(i) (ii) cosech nx, (iii) Ee 
(iv) e™ sinnx, (v) (/(nt+1)—~n) cos nx, (vi) ot 2 ; 
n® log (n+1) 


Sees 
nl +nx?)’ 
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2. Show that the complex series 
e 1 
converges uniformly on any bounded subset of 
Z* =Z-—{.;., —2i, —i, 0, i, 2i, ...}, 
but not on Z*. 


: oS aX 
3. Show that the series le ee 


— a 
converges uniformly on the interval [— 1, 1]. 


4. The sequence (a,,) of real numbers is monotonic and a, > 0 as n > oo. Prove 
that the power series = a,z" converges uniformly on B(O; 1)—BU;¢) (> 0). 


5. The functions fon (— 0, ©) and g on (1, «) are defined by 
1 ae 
,wtn 3x 2° . Pe n= 


f(x) = x 


Prove that f’ and g’ exist on (— ©, ©) and (1, ©) respectively, and are obtained 
by term-by-term differentiation. 


6. Let « > 0 and define uw, on [0, «) by 


a 

al). 5 1+ 72x?" 

(i) Show that Xu, converges uniformly on [0, 1] . and only if « > 1. [To 
prove non-uniform convergence for « < 1 consider Ds u,(1/n).) 

(ii) Show that =u, converges uniformly on [0, ~) if oa only if1 <a < 2. 

x"(1-—x) 

pee Fs M,. = = SBD. e-—, 
| jones log (n+ 1) 


Prove that =x"(1—.x)/log (n+1) converges uniformly on [0, 1], but that 2M, 
diverges. 


8. Find a series Xu, which converges uniformly and absolutely on an interval 
[a, b] and is such that =|u,| does not converge uniformly on [a, 5]. 


ts 
get ars 


9. (i) Prove that the series 


converges uniformly on [0, ©), but that it does not converge absolutely for any x. 
(ii) Let (r,) be a sequence consisting of all the rational numbers and define the 
functions ff, on (— ©, 00) by 


= 


1. 42pf + = 7; 

0 otherwise. 

= 1 

2 TnI —ZOD) 

converges (absolutely) for every x, but does not converge uniformly in any 
interval. 


Prove that the series 
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10. Denote by [a] the integral part of the real number a. 
Show that the series © [nx] 


oi 


converges uniformly on any finite interval. Prove that the sum function s is 
continuous at any irrational point; and that, at a rational point p/q (where p, q 
are coprime integers and g > 0), 


11. Prove that, for 0 < x < 1, the series 
BY sin (7x") 
n=1" 


converges to a function f and that the convergence is uniform in every interval 

[0, a] (0 < a < 1). 
Show that the series es 

>. — sin (17x") dx 

n=1 n 


1 ‘a. 
converges and deduce that this sum is equal to ; f(x) dx (= lim : f(x) dx) : 
0 a—1l-— 


12. Tannery’s theorem. The complex valued functions v9, U1, Va, «.. defined on the 
set {1, 2, 3, ...} satisfy the following conditions: 


(i) for fixed r, lim v,(”) = w,, 
nD 
Gi) sup |v,(1)| = M, and = M, converges. 
n21 


For n = 1, 2, ..., let 
F(n) = v,(n) + 01(1) +... + Van), 


where p is an increasing function of n and p(n) > 0 as n > ©. Prove that 
ive] 
lim F(n) = > ,. 
m—> oo r=0 

[Define the functions uo, uy, Uz, ... on the set {0, 1, 4, 4, ...} endowed with the 
usual metric on R' by 

(*) v,(n) if p(n) 2 r, 
Uy = 


ce ; u,(O) = Wr 
n 0 if p(n) <7; 


and use the corollary to theorem 5.21.] 


13. Deduce from 12 that, for every complex number z, 


lim (1+7) = exp Z, 


na 


co 
where exp z is defined as 5) 2"/n!. 
n=0 
14. Sequences of monotonic functions. Each function f, is monotonic on the finite, 
closed interval [a, b] and the sequence (/,) converges pointwise on [a, b] to a 
continuous function f£. Prove that the convergence is uniform. 
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5.4. Further properties of power series 


With the concept of uniformity at our command we can prove 
theorems deeper than those of §4.6. 


Theorem 5.41. If the power series Xa,,z" has radius of convergence R 
and) < r < R, then Xa,,z” converges uniformly for |z| < r. 

Proof. When |z| <r, |a,z"| < |a,|r” and XJa,|r" converges by 
theorem 4.61. The result now follows, from the M-test. | 


Theorem 5.42. The sum of a power series is continuous within its 
circle of convergence. 


Proof. Let R be the radius of convergence of Xa,,z” and let z, be 
any point such that |z| < R. If |z| <r < R, Xa,z” converges 
uniformly for |z| <r and so, by the corollary to theorem 5.21, 
La,z” is continuous at Zp. | 


Theorem 5.43. (Principle of equating coefficients.) Jf Xa,,z” = Xb, z” 
for a sequence of non-zero values of z tending to 0 as limit, then a, = b, 
Sor all n. 


Proof. Let (z;,) be a sequence of non-zero complex numbers such 
that Xa,z" = Xb, 2" for z = z,, where lim z, = 0. 

Suppose that the theorem is false and let a,,, b,, be the first pair of 
unequal coefficients. Then 


z™ : nin —Omin)2” = 0 
0 


forall z, and so » ab, 2 = 0 
n=0 


for all z,. The power series clearly has a positive radius of converg- 
ence. It is therefore continuous at z = 0 and so its sum for z = Ois 0. 
Thus a, —5,, = 0 and we have a contradiction. | 


It was proved in C1 (p. 101) that 
& a, 2") ( > b, 2") i: SjIGh + eee 
n=0 n=0 n=0 


when z lies in the discs of convergence of both Xa,,z” and D4, z”; in 
other words, the product of two power series is obtained by formal 
multiplication. The next theorem shows that formal manipulation 
also leads to the power series of composite functions. 
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Theorem 5.44. Let 


f@) = % az" ([2| < RB), 


nN 


a) = % baz” (lel < 5). 
If |by| < R, there isap > 0 such that | 
f@@) = E erz* (lz <P), 


where the coefficients c,, are obtained by formal substitution: defining 
Pun (k,n = 0, 1, 2, ...) by 


co 
g*(z) -— 2, Pat's 
n=0 
cO 
we have c= 2; Gc Paw 
k=0 


Proof. 
(i) Suppose that ¢ is such that x |b,6"| < R. Then |g(Q| < R 


and therefore 
Ae) = ¥ ag = 3 ae(E rant) = = (= a Pent) 
k=0 k=0 n=0 k=0 \n=0 


If the order of the two summations on the right may be reversed, we 
have 


Ae) = 3 (E aeping®) = E ent 
n=0 \k=0 n=0 
By theorem 4.74, this reversal is legitimate if 


p> ( p> ae Pens") 
k=0 \n=0 
converges. 
The power series & |b,,|z" has the same radius of convergence S as 
oe 2". Let a 
h(z) = a, [b,,|z” (|z| < S). 


Then Mie S a,c, 
n=0 
where 
Qin = »> tone ia! 2 | p> . see bn, | = | Pen 


M+... +M=Nn M+...+¢Mm=Nn 
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Now & |a,|z* has radius of convergence R, 
E alg = Me) and Wie) = ¥ lose < R. | 
Therefore bs |a;,| (= dnl] 
converges. Since |Pynf”"| < un|S"|, it follows that 


ai (5m) = 5, (5, turn 


=0 


oc 
>> 
k=0 
converges. 


(ii) By theorem 5.42, A is continuous in the disc |z| < S. Since 
h(0) = |b)| < R, there is therefore ap > 0 such that 


E [bnz™| = A(lz)) < R 
for all z in the disc |z| < p. | 


Notes. (1) If 5, = 0, then 
Pin = oe Tn ‘a by ==" () 
Nyt eo +NE=N 
fork > nand so Ct Fa De 
k=0 


is the sum of a finite series. 

(2) If R = oo, then the inequality = |b,z"| < R is satisfied by all 
z such that |z| < S; thus p = S. 

We now use theorem 5.44 to obtain the reciprocal of a power series. 


Theorem 5.45. Let 
$e) = E az" (\2| < S). 


If a) + 0, there isa p > 0 such that 


1 fo) 
Soe me 2 Pye” Cz: <p), 
BG) = dy nz” (lel < 0) 
where the coefficients £,, are obtained from the equations 


My Bo sie l, 
Oty By + Hy Bo = 0, 
hy Bo +f, +%fy = 0, 


eeoereeeeeoeeaee eee ee ee o 
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Proof. In theorem 5.44 take 


fey == EST (el < lad 


Oy tz 
and e(z) = 2 Yel —'S). 


Then S(g(z)) = 1/P(2) 


and 1/A(z) may be expanded in a power series 2f,,z” for all z such 
that = 
x |, 2”| ies |%|. 


By continuity, this inequality holds in a disc |z| < p. 

The formulae of theorem 5.44 give explicit but cumbersome 
expressions for the f,. It is simpler to evaluate Ba Ps Pe xy: SOCEES- 
sively. When |z| < p, 


(5, a2") (5, pa2") = $2). =4 


and so, by theorem 5.43, 
dy hg = 1, , fe. 0° (= 1, 2,2). | 


In the rest of this section we deal with real power series only. 


Theorem 5.46. Suppose that the real power series 2a,,x" converges to 
f(x) in its interval of convergence (—R, R). Then 
(i) f is differentiable in (— R, R) and 
f(x) = X na,x™™; 
n=1 


(ii) for |x| < R, 


Proof. 

(i) By theorem 4.62, Zna, x"— has radius of convergence R and, by 
theorem 5.41, the series converges uniformly on any (finite) closed 
subinterval of (— R, R). Theorem 5.23B now gives the result. 

(ii) If x is any point of (—R, R), La,x" converges uniformly on 
[0, x] (or [x, 0] if x < 0). Now apply the corollary to theorem 5.22. | 
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Theorem 5.47. (Abel’s continuity theorem.) Jf Xa, converges, then 


Dy anxX" > 3a, .aS> x>1l-. 


n=0 n=0 


Proof. By theorem 5.34, Xa, x” converges uniformly, say to f(x), 
for0 < x < 1. Then by the corollary to theorem 5.21, fis continuous 
on [0, 1]. In particular, f(x) >f(1) asx >1-. | 


Example. 1-4+4—4+... = 42. 


If |x| <1, 


Hence, by theorem 5.46 (ii), 


arc tan x = > et gett 


for |x| < 1. Since = (—1)"/(2n+1) converges, 


. (oe) (- 1)” 
] t = ; 
: lio arc tan x 2 ae 


The left-hand side is arc tan 1 = 47. 


Exercises 5(d) 
1. Evaluate the following complex limits: 


() tiny Say ea oe el 
29 2 ea 20 (e7— 1)? 
2. Find the power series expansions up to the term in z‘ of the following functions: 
(i) cos (sinz), (ii) e¥@+#, ii) sin (cos z). 
3. The function fon R? is defined by 


oe ee ror x 0, 


for x = 0. 


Show that, at 0, fhas derivatives of all orders. Evaluate f’"(0). 
4. Given the series 5) a, and > b,, let 
0 0 


Cn, = Agbat... +n bo (n = 0). 


(i) Prove that, if Xa,, &b,, Uc, converge to sums A, B, C respectively, then 
AB = C. 
(ii) Construct convergent series Xa,, Ub, for which <c,, diverges. 
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S. Let do, @). do, ... de real. 
(i) Show that, if Xa, converges, then La,/(n+ 1) converges and 


| ee) oe) a 
a,x") dx = — , 
I (2, a n+1 
(ii) Show that, if =a,/(n+1) converges, then the last equation holds, though 
the integrand may be unbounded. 


6. Prove that 


1 
,-1,! 1 


“319 BT = 


a 1 +2 log (1+2)}. 


1 
42 


7. Prove that, if —1 < x < 1, 


(ee) aire | n 
fog(i+x}?2=2>3 (oT Set a 
n=2 n 
4 1 
where i= I+stgt..t7. 


Is the result valid for x = 1? 


8. Assume that, when a is any real number, the binomial expansion 


(+x)*= 2 (") x 
n=0 \t 
holds for —1 < x < 1 (see Cl, 102). 
Show that, when « > 0,> (*) x" converges uniformly to (1+.x)* for 


—1 <x < 1; and, when a > —1, the series converges uniformly to (1 + x)% for 
—1+éd <x <1(6> 0). See exercise 4(/), 2.) 


9. Prove that, for |x| < 1, 


wee Te Free 
log (xt V(l+x9} = x3 +5 ae UE 


10. Prove that, for all real a and /, 
Ss 4%\/ 2 a+p 
ay) €?)- 


5.5. Two constructions of continuous functions 


Each of the two theorems in this section carries out the construction 
of a continuous function which is to have an assigned property. The 
method is to build up a uniformly convergent series of approximating 
functions designed to induce the required property in the limit 
function. 

A continuous, nowhere differentiable function. It is easy to construct 
a continuous function which fails to have a derivative at the points 
of a finite or a countably infinite set. The extension to an uncountable 
set is more difficult. 
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Lemma, Let f be a real function bounded on (— ©, ©). If 
LO —F (x) 
c—x 
is unbounded, then f’(c) does not exist. 
Proof. Let |f(x)| < M for all x. Suppose that f’(c) does exist. 
Then there is a d > 0 such that 


OI) -£'0| <4 


for 0 < |c—x| < 6, so that 


Fo) -FO) 
c=* 


< if'(o| +1 for 0<|c—x| < 4; 
fO-f)| . 2M 


for |c—x| 20 
c—x 


and 


The last two inequalities contradict the unboundedness of 
{f(o) -F@)}(e- x). | 
Theorem 5.51. There exists a real function which is continuous on 
(— 00, 0), but nowhere differentiable. 
Proof. Let g be the function on (— ©, 00) defined by 
g(x) = x for [x <.2 
g(xt+4n) = g(x) (w= +1, +2,...). 

Then (i) g is continuous, (ii) 0 < g(x) < 2 for all x, (aii) for all x, y, 

eae? <1 

aa 

(iv) given c, there is an x, such that 


g(c) —8(%)| _ | 


C— Xo 
When n = 0, 1, 2, ..., define the functions g,, by 
Sn(x) = ag(b"x), 


ee a,b are positive constants. Then (i)’ g, is continuous, (ii)’ 
< g,(x) < 2a” for all x, (aii)’ for all x, y, 


£,(X) —Zn(y) < gbn 
x—-y = : 


|c—x)| = 1 and 
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(iv)’ given c, there is an x, such that 


& n(C) me n(Xn) 
c~- 2, 


lc—x,| = b-™ and = ab”, 


if 0 <a < 1, Ze) converges uniformly on (— ©, ©) (by the 
M-test) and so a 
Ss = 2 En 


is continuous everywhere (by theorem 5.21). However we shall show 
that, if a is sufficiently small and 5 is sufficiently large, fis nowhere 


differentiable. 
Taking ab > 1 we have 
LO-IEn)) _ | F&O — §i(%n) 
C—-Xn i=0 C= Xn 
8n(C) — 8n(Xn) ae (‘= = 8i(C) —8:(Xn) 
‘ C—Xn Xt, C—Xn 
n—1 roe) 4 
> a™b"— > abt — > = 
i=0 i=n+1 
= el gil. 2a 
— ab—1 l—a 
1 pi: ee aren 
. ( 1 -) . 
gs l 2a 
Hence, if also 7 > 0, 


the lemma shows that /’(c) does not exist. This inequality holds if a 
is small and b large enough, for instance if a = 4, b = 20. Thus the 


function given by o | 


is everywhere continuous and nowhere differentiable. | 


The non-differentiability of f is not dependent on the lack of a 
derivative of g, at the points 2k/b”. It is possible to start with a 
differentiable function g; in fact, the first continuous, nowhere 
differentiable function, constructed by Weierstrass, was 


a” cos b"nx (0 <a < 1,ab > 1+37). 
n=0 
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Tietze’s extension theorem. If a real function f is continuous on 
[a, b], it is easy to define a function f* continuous on R! such that 
f*(x) = f(x) for a < x < b. Tietze proved a theorem extending the 
domain of definition of a given function continuous on a closed set 
in an arbitrary metric space. 


Lemma. Suppose that (X, p) is a metric space, g is a real valued 
function continuous and bounded on a closed set F in X and 


sup |g@)| = M. 


Then there is a real valued continuous function h on X such that 
(i) |A~x)| < 4M for xe X, and 
(ii) |g(x)—A(x)| < 4M forxeF. 
Proof. Let A= {xeF|-—M < g(x) < -4M}, 
B = {xe F|-4M < g(x) < 4M}, 
C = {xe Fl4M < g(x) < M}. 


By theorem 3.23, A and C are closed and so, by exercise 3(f), 12, the 
function h on X given by 


re p(x, A) — p(x, C) 
me) = Mina, A+ poEc) 


is continuous. Clearly (i) holds. To establish (ii) we compare the 
values of g and / on the sets A, B, C. 

On A, -—-M< g(x) < -4M and A(x) = —4M. 

On B, lg(x)| < 4M and |h()| < 4m. 

On C, 4M < g(x) < M and A(x) = 4M. | 


Theorem 5.52. (Tietze’s extension theorem.) Let (X, p) be a metric 
space and let f be a real valued function continuous and bounded on a 
closed set F in X. Then there is a real valued continuous function f* 
on X such that 

(i) f*(x) = f(x) for x € F, and 

(ui) the bounds of f* on X are the same as the bounds of f on F. 


Proof. Since we can add a constant function to f, we may suppose 
that the range of fis symmetrical about 0; say 


inf f(x) = —K, sup/f(x) = K. 
rer xref 
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Define the sequence (/,,) of real valued, continuous functions on X 
as follows. Let f, be the function A of the lemma corresponding to 
g = f. When f,, ..., f,-1 have been defined, let f,, be the function h 
of the lemma corresponding to g = f—(fi+...+/n-1). Induction 
shows that, for all n, 


\f(x)| < 3G)""*K (xe X), (5.51) 
If) -TAG) +... +f] < @"K we F). (5.52) 


By (5.51) and the M-test, Xf, converges uniformly on X to a 
function f*. Theorem 5.21 shows that f* is continuous on X and 
(5.52) shows that f*(x) = f(x) for x e F. Again, by (5.51), 


If*@)| < 4K E @=K 
for all x e X. Hence 


K = sup f*(x) < sup/*(x) < K 
acer reX 


and so sup f*(x) = K. Similarly inf f*(x) = —K. | 
rex rex 


Corollary. (Urysohn’s lemma.) Let (X, p) be a metric space and let 
A, B be disjoint, closed subsets of X. Then there is a continuous function 
f: XR such that 0 < f(x) < 1 on X, f(x) = 0 for xEA and 
I(x) = 1 for:x€ B. 

This result shows that, given an arbitrary metric space, there is a 
plentiful supply of real valued functions continuous on it. 


Exercises 5(e) 


1. Construct a function on an interval [a,b] which is everywhere n times different- 
iable, but nowhere has an (m+ 1)th derivative. 


2. Let (r,) be a sequence consisting of all the rational numbers in (0, 1) and 
define the function f on [0, 1] by 
Se LP, 
fx) = & ee ; | 0 <x <1). 
nm=1 Nn 
Show that fis continuous on [0, 1] and not differentiable at the points ry. 
[Define u, (n = 1, 2, ...) on [0, 1] by 
—1 fordO< 


i forr, < 


< Irn 
<1 


Un(x) = | 


x 
x 


H omy ee) 
and consider { ( un(sd|n*) dx.] 
0 \1 
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3. Give an example to show that the conclusion of theorem 5.52 need not hold 
if the condition that the domain F of fis closed is omitted. 


4. Let F be a compact subset of R*. Prove that, if fis a real valued, continuous 
function on F, then there is a continuous extension f* of f which is zero outside 
a sufficiently large open ball B(O; R) and is such that 


sup |f*(x)| = sup |f()]. 
xeRk xerk 
5. Prove the following extension theorem. Let CX, »), (Y, 7) be metric spaces 


and let E’ be a subset of X. If (Y, a) is complete, then every uniformly con- 
tinuous function f: E > Y has a unique, uniformly continuous extension to E. 


5.6. Weierstrass’s approximation theorem and its generalization 


An important problem of analysis is the approximation of func- 
tions by simpler functions. Weierstrass’s theorem deals with the 
approximation of continuous real functions by polynomials. 


Lemma 1. There is a sequence of polynomials converging uniformly to 
|x| for -l <x <1. 


Proof. If u = 1—.x?, then |x| = ./(1—w) and the interval 0 <u <1 
corresponds to the interval —1 < x <1. By exercise 5(d), 8, the 


binomial series ee 
E(w 
converges uniformly to ,/(1—u) for 0 < u < 1}. Hence the series 
3 (?) (x21) 
n=0 \tl 
converges uniformly to |x| for -1 < x < 1. | 
Corollary. Let the function g be given by 
. ( 0 “for x < 0, 
a lx for x > 0. 
Then, for any interval [a, £], there is a sequence of polynomials con- 
verging uniformly to g(x) fora <x < f. 
Proof. Let k be such that [a, 6] < [—&, k]. Since 
x 


i) 


the result immediately follows from the lemma. | 


Xx 
| a(x) = He (E+ 
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Theorem 5.61. (Weierstrass’s approximation theorem.) If the real 
function f is continuous on the finite closed interval [a, b\, there is a 
sequence of polynomials converging uniformly to f on [a, 5}. 


Proof. It is sufficient to prove that, given ¢ > 0, there is a poly- 
nomial p such that 


Se ae 


There is (see exercise 3(f), 9) a piecewise linear function h such 
that 
f(x) -h@)| < fe for a<x<b. 
Let the vertices of the graph of h be (Xp, Vo). (Xi Vids «++ (ns Vn) 
where x) = a and x, = b. Then, fora < x < B, 


n—1 
h(x) = Yor Y ccg(x—%i); 
where the c’s are defined by the equations 


Yr = Yot Co(X1— 0), 


Vo = Yo Co(Xe— Xo) + Ci(%e — Xy)> 


Yn = Vot Co(Xn es Xo) = C(Xp, ae x4) air ie Cain Be Xn—1) 
By the corollary, there is a polynomial p such that 


|A(x)—p(x)| < te for a<x<b 
and this gives 
If()—-p()| <e for a<x<b. | 


This proof, due to Lebesgue, is one of the simplest of the many 
proofs of Weierstrass’s theorem. Some of these are easily adapted 
to show that the result holds in R* (k > 1). M. H. Stone has put the 
theorem in a more general algebraic setting which exposes the 
essential features of the original Weierstrass theorem. To complete 
this chapter we shall prove Stone’s theorem (5.62), but neither the 
theorem, nor the algebraic setting will be appealed to in the rest of 
the book. 

Let (X, 7) be a metric space and, as usual, let C(X) be the space 
of real valued functions bounded and continuous on X. Algebraic 
operations in C(X) are defined in the obvious way. If f; ge C(X) 
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and a is a real number, the (bounded and continuous) functions 
f+, fg and af are such that, for all x e X 


(f+g)(x) = f(x) +8), 
(fg)(x) = f(x)a(); 
(af) (x) = of (x). 


Definition. The subset A of C(X) is called an algebra if, whenever 
J, g¢€A and « is any real number, f+ g, fg, ofeE A. 

The set of polynomials and the set of differentiable functions on 
[a, b] are clearly algebras in C[a, 5]. 

Using the usual metric p on C(X) we now consider the closure 4 
of an algebra A. A is, in fact, the set of those functions in C(X) 
which are the limits of uniformly convergent sequences of members 
of A. 


Lemma 2. If (X, 7) is any metric space and A is an algebra in C(X), 
then A is also an algebra. 


Proof. Let f, g € A. Then there are sequences (f,,), (g,) of members 
of A which converge uniformly on X to f, g respectively. 

Clearly f, +2, >/+g uniformly on X and, if « is any real number, 
af, > af uniformly on X. Also, since all functions in C(X) are 
bounded, every uniformly convergent sequence is uniformly bounded. 


Theref 

eae Inka Je = J.(8.—2) +8U,—S) ~~ 
uniformly on _X. Thus, since f,+8n, Sn Sn8n€A, it follows that 
+8, of, fg eA. | | 


Weierstrass’s approximation theorem asserts that the algebra A, 
of polynomials on [a, b] is dense in C[a, 5], ic. that A, = C[a, b]. 
We shall see that, more generally, the relation A = C(X) holds if 
(X, 7) is compact and if the algebra A satisfies the following two 
conditions: 

(i) A contains some non-zero constant function and so all constant 
functions. 

(ii) A separates points on X, i.e. to every pair x1, x, of distinct 
points of X there corresponds a function fe A such that f(x,) + f(x). 

The algebra of polynomials on [a, b] satisfies both (i) and (ii); 
the algebra of even polynomials-on [—1, 1] satisfies (i), but not (ii); 
and the algebra of polynomials on [0, 1] with zero constant term 
satisfies (11), but not (i). 
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Theorem 5.62. (The Stone-Weierstrass theorem.) Let (X,7) be a 
compact metric space. If A is an algebra in C(X) which contains the 
constant functions and which separates points on X, then A = C(X). 


Proof. 
I. We first show that, if fe A, then |/| ¢ A (where | f| is defined by 
Fite) =| /Go). “Let sup |f(x)| = # and take any ¢ > 0. By the 


corollary to lemma 1, Beiey is a polynomial p given by 
P(E) = M084... +%,5" 

such that |é|-p(E)| < e for -w< <p. 

As A is an algebra containing the constant functions, 


ce pof = %+a,ft+...+%,f” 
belongs to A. Also 


| F)| -—(P MOD! = [LFC)|—P{FC)}| < € 
for every x € X. Since A is closed in C(X) (by theorem 2.36), it follows 
that | f| € A. 
II. Defining min (f, g) and max (f, g) by 
(min (f, g))(x) = min (f(x), 8), 
(max (f, g))(x) = max (f(»), g(>)) 


min (f,g) = #(f+g-|f-8)), 
max (f, g) = 4(f+g+|f-8)). 


Hence, by I, if f, g ¢ A, then min (f, g), max (f, g) € A. 
III. Let f be any function in C(X). We need only show that 
given € > 0, there is a function g € A such that 


|f(x) -a(x)| < € 
for all xe X. 


Let u, v be two arbitrary distinct points of X. There is a famction 
6¢€A such that ¢(u) + d(v) and, since A contains the constant 
functions, we may assume that ¢(u) = 0 so that d(v) + 0. Also let 
wy be the constant function given by (x) = f(u) for all x € X. 

If « is the real number such that 


ap(v)+f(u) = fe), 
the function Sum = opty 
is such that g,,,(u) = f(u) and g,,(v) = f(v). We note that g,,, € A. 


we have 
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Since g,,, and f are continuous, the set 
K, = {(x|xe X3 Bux) < f(x) +9} 


is Open; moreover it contains both u and v. Hence the collection 
of sets K,, where v ranges over X—{u} is an open covering of X. 
Since X is compact, by the Heine—Borel theorem there is a finite 
number of sets K,,, ..., K,, covering X. Let 


Su = MIN (Suy,s -++> Suv,)- 
By Il, g,,€ A; also g,(u) = f(u) and g,(x) < f(x) +e for all xe X. 
Oe tia eee 
is open. We note that 
F(x)—-€ < g(x) < f(x)+e for xeJ,,. 


The collection {J,},,-x is an open covering of the compact space Y 
and so a finite number of sets J,,,, ..., Jy, covers X. Let 


& = MAX (Gy,5 ---5 Bein) 
Then g € A and f(x)—-€ < g(x) < f(x)+¢ forall xe X. | 


Clearly the multi-dimensional form of Weierstrass’s approxima- 
tion theorem is a particular case of theorem 5.62. A trigonometric 
analogue of Weierstrass’s theorem is given in exercise 5(f), 2. 

There are also complex versions of theorems 5.61 and 5.62. For 
instance every complex function continuous on a compact subset of 
Z is the limit of a uniformly convergent sequence of polynomials in 
z and Z. That polynomials in z alone will not do is a consequence of 
another theorem of Weierstrass. (See exercise 13(a), 6.) 


Exercises 5(f) 
1. The real function fis continuous on [a, 5] and, for n = 0, 1, 2, ..., 


[ : Fx)xtdx = 0: 


b 
Show that : [ Tixvis = 0 
a 


and deduce that f(x) = O fora < x < b. 
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2. A trigonometric polynomial is a real function of the form 


nN 
Ag+ > (a, cos rx+b, sin rx). 
r=1 
Deduce from theorem 5.62 that, if fis continuous on [a, £], where f—a < 2z7, 
then f is the limit of a uniformly convergent sequence of trigonometric poly- 
nomials. Show that this result is false if S—a > 27. ies condition of theorem 
5.62 is then violated ? 


3. Prove that every real function continuous on the interval [a, b] is the limit of a 
uniformly convergent sequence of even polynomials if and only if (a, b) does not 
include the origin. 


4. Alternative proof of Lemma 1. Show that the only solution of the equation 
y = 40°+1-x’) 


which satisfies y < 1 is y = 1— |x|. 
For —1 < x < 1, define the sequence (y,,) of functions by 


y(X) = 1, ra) = HAG)+1-x7} (= 1, 2,...). 
Prove that, for -1 < x < landu=0,1,2,. 
Yn(x) > 1—|x| and oe Yn(X). 
Deduce that y,(x) >1-— |x| as noo, uniformly for —1 < x < 1; and hence 


that there is a sequence of polynomials converging uniformly to |x| for 
—-1<x <1. 


NOTES ON CHAPTER 5 


§5.1. The notion of uniformity of convergence was made explicit by three 
mathematicians independently, Weierstrass, Stokes and Seidel in the late 1840s. 
The idea was rediscovered a few years later by Cauchy. A critical examination of 
the exposition of the early writers was made by Hardy, ‘Sir George Stokes and 
the concept of uniform convergence’, Proc. Cambridge Phil. Soc. 19 (1918), 
148-56. Hardy’s note includes developments of Stokes’s work and answers 
questions which the reader may have asked himself on §5.2. It is noticeable in 
theorem 5.21 that the full strength of the hypothesis of uniformity is not used in 


the proof. The inequality sup o(f,(x), f(x) < € 


is quoted only for a single n, not for allnm > N. This indicates that a condition less 
restrictive than uniformity should suffice for theorem 5.21. Such a wider condi- 
tion, due in substance to Dini and called quasi-uniform convergence, is necessary 
as well as sufficient for the continuity of the limit function of a sequence of 
continuous functions. However it lacks the simplicity possessed by the condition 
of uniformity of convergence and, instead of reproducing it here, we refer the 
interested reader to Hardy’s paper. 


§5.4. N. H. Abel (1802-29) established his continuity theorem with the 
immediate aim of investigating the validity of the binomial expansion at the 
points +1. In the original proof, partial summation (theorem 4.52, corollary) is 
used to show that on the acuta side of the identity 


co 


Sm ->, AnX" = 2 af{l—x)+ D  a(i—x") 
n=0 N+1 
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the second term can be made arbitrarily small for all x in (0,1) by choice of a 
large N; the first term may then be made small by choice of x near 1. 

The relation 1-4+1—... = 47 was first noted by J. Gregory (1638-75) and 
was later obtained independently by G. W. Leibniz (1646-1716). 


§ 5.5. Weierstrass’s construction of a continuous, non-differentiable function 
was published in 1861. Until then arguments based on geometrical intuition 
were still acceptable to some analysts. The fallibility of such reasoning was 
finally demonstrated by this function which so strikingly contradicts the promp- 
tings of the visual imagination. 

The proof of theorem 5.51 is based on a method developed by F. A. Behrend 
(‘Crinkly Curves and Choppy Surfaces’, Amer. Math. Monthly, 67 (1960), 
971-73). : 


§5.6. Stone gave an account of ‘The generalized Weierstrass approximation 
theorem’ in Math. Magazine 21 (1948), 167-84 and 237-54. This paper is not 
readily accessible, but it has been reprinted, with only minor changes, in Studies 
in Modern Analysis (ed. R. C. Buck), 20-87. 


6 
INTEGRATION 


6.1. The Riemann-Stieltjes integral 


Suppose that f is a bounded real function on the finite interval 
[a, b]. If Z is a dissection of [a, b] given by 
i eo. «ee XD, 
let int fixy=m, sup fy) = 
Vi-1S LH} ~~ |S 2X 


Then the sums 


nr 


x m,(x; ae Xi-1)s = M(x; ens Nes) (6.1 1) 


b 
approximate to the Riemann integral | Ff, when it exists (C1, 123-4). 


A far reaching extension of this process is due to Stieltjes who 
introduced a second real function g, assumed to be increasing on 
[a, b] (in the wide sense), and replaced the increments x;—x,;_, in 
(6.11) by g(x;)—g(x;_1). This new procedure leads to an integral of f 
with respect to g. The sums corresponding to (6.11) are 


() = (fg) = % mige)-eeidh (6.12) 


S(P) = SZ, f, 8) = PA M {g(x:) -— 8%}: (6.13) 


they reduce to (6.11) when g(x) = x (a < x < b). Denoting the 
infimum and supremum of f(x) on [a, b] by m and M respectively we 


e 
m{g(b)—g(a)} < (F,f, g) < SG,f,8) < Mig(b)—g(@)}. 
Thus, for all dissections Z, the lower sums (6.12) and the upper sums 
(6.13) are bounded. 

It is easy to see that the introduction of new points of division 
increases the lower sum and diminishes the upper sum. (See exercise 
6(a), 1.) From this it follows that any lower sum is less than or equal 
to any upper sum. For let , and J, be any dissections of [a, 5]. If, 
now, @ is the dissection containing all the points of division of F, 
and &,, then 


(DQ) < (F), SF) < SF), SP) < SCF) 
and so s(F,) < S(F,). (6.14) 
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Definition. Write 


sup (P,f, g) = [7 dg (or | e, (det) 


inf S(Q, f, g) = : fdg (or [709 d(x), 


where the supremum and infimum are taken over all dissections DZ of 
[a, b]. The first expression is called the lower integral of f with respect 
to g over [a, b}, the second the upper integral. 


Note that | fdg and { fdg exist whenever fis bounded on [a, 5] and 


g increases; also, by (6.14), 


[foe Pom 


b Tb 
Detain | fag = | fads, 


} is said to be integrable with respect to g over [a, b] and the common 
value of the upper and lower integrals, denoted by 


[tae (or [’ s004800) 


is called the Riemann-Stieltjes (or RS) integral of f with respect to g. 
The function g is called the integrator, the function f the integrand. 
The class of functions integrable with respect to g over [a, b| is denoted 
by R (g; a, 5). 

It is convenient to complete the definition of the RS integral by 


putting : : 
[; fag = -| fae 


(when the right-hand side exists) and 


| : fdg-=0 
(for all functions f, g). i 
When g(x) = x(a < x < bd), the Riemann-Stieltjes integral re- 
duces to the Riemann integral. The definition of the Riemann 
integral given in C1 (123) is not obviously the same as the one given 
here in terms of upper and lower integrals, but the equivalence of the 
two definitions will be proved in theorem 6.72. Both definitions are 
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useful. The class of functions Riemann integrable over the interval 
[a, b] will be denoted by R(a, b); and upper and lower Riemann sums 


by S(Z,f), (fF). 
Illustrations 


(i) Every constant function k has an RS integral with respect to any increasing 
function g and 


b 
| kdg = kig(b)—g(a)}. 


This follows from the fact that, for all J, (FD) = S(D) = k{g(b)—g(a)}. 
(ii) Let f be the function defined by 
1 if x is rational, 
I(x) = {) if x is irrational. 
Then inf f(x) = 0 and sup f(x) = 1 in every interval. Hence, when g is any 
increasing function, 


b Tb 
{ fde =0 end | fie = ot. 


Thus, if g is not constant, f¢ R (g; a, bd). 


In the rest of this section we establish the elementary properties of 
the Riemann-Stieltjes integral. When defining it we made the assump- 
tion that the integrator is increasing and the integrand is bounded. 
We continue with this pre-supposition without always stating it 
explicity. 


Theorem 6.11. A necessary and sufficient condition that f € R (g; a, b) 
is that, given e > O, there is a dissection D of [a, b\ such that 


S(F,f, 8) -(PF,f; 8) < &. (6.15) 
Proof. 


(i) If fe R(g; a, b), ie. if 


| “fg = [fae 


then, given e > 0, there are dissections Y, and Y, such that 


b b 
S(F,)- | fdge <te and | fdg—s(B,) < te. 
Thus S(F1) -— (Be) < €. 


Now let F be the dissection containing all the points of division of 
9, and of Y,. Then 

S(D,) = SY) = (F) = (2) 
and so (6.15) holds. 
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(ii) Suppose that, for every e > 0, there is a M such that (6.15) 
holds. Then, since 4 = 
b b 
S(2) > | fdg > | fg > 9), 


fo b 
it follows that © o'< | fae-| fdg <€. 
| TS < b 
This holds for all « > 0 and so | fdg = | fae. | 
a 7a 


The next theorem gives the existence of the Riemann-Stieltjes 
integral in two simple but important cases. In its proof, and later, 
we call the length of the largest subinterval of a dissection Z the 
mesh of Z and we denote it by “(Y). 


Theorem 6.12. 

(i) If f is continuous on [a, b], then fe R(g; a, b). 

(ii) If fis monotonic on [a, b] and g is continuous (as well as increas- 
ing), then fe R(g; a, b). 


Proof. 
(i) For any dissection 9 of [a, b] we have, with the usual notation, 


S(@)- 9) = ¥ (Mi—m) {ge -a%-a)} 
<a Mh) as (g(%i)— 8%} 


max (M;—™m,) {g(b) — g(a}. 


1<i<xn 
Since f is uniformly continuous on [a, 5], max (/;,—m,) may be 
made arbitrarily small by taking “(Y) sufficiently small. Theorem 6.11 


then shows that fe R(g; a, b). 
(ii) Suppose that f increases. Then 


S(2)-D) = & V)-SOr-d}Heed - a} 
< max {g(%)—8i-)}H FO) -—/@} > 0 
as (D2) -> 0, by the uniform continuity of g. | 


Part (ii) of the theorem will be much improved in theorem 6.24. 
We now prove that the RS integral is linear in both the integrand 


and the integrator. 
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Theorem 6.13. 
(i) (a) Iffe R(g; a, b), thenkf € R(g; a, b) for every constant k and 


b b 
[ (de =k [Fas 
(b) Iff.,f2 € R(g; a, b), then f,+ fo € R(g; a, 5) and 
b b b 
[ Gras =| fader] fads 
(ii) (a) Iffe R(g; a, b), then fe R (kg; a, b) for every non-negative 


constant k and . : 
[ fates) = k [Fas 


(b) If fe R(g1; a, b) and fe R(g2; a, 6), then 
Je R(g1 +82; 4, 5) 


and [ fawitey = [ fde.+ | fe. 


Proof. 
(i) (a) This follows immediately from exercise 6(a), 2. 
(b) We first note that, since 


inf f, +inf f, < inf (+f). 
S(D,fi, 2) +(D, fo, 2) < (FD, fi tSe g)- (6.16) 


Now, given € > 0, there are J,, Z, such that 
b b 
(Pr,fra) >| fidg—e and %Dyfys)>| fde-e 
and, if D has all the points of division of Z, and of J,, 


b b 
(P,fog)>| fdg-e and fog) >| fde—e. 
Hence, by (6.16), Z : 


b b b 
| fde+| fodg < (DZ, fith, g)t+2e < [Git Adde +2. 
Since this holds for every e > 0, 


[Ade+| fade < | irAdae 


144 INTEGRATION [6.1 


It is proved similarly that 


[res Pade > PP Uirsode 


These two inequalities show that, if f,, fo € R (g; 4, 9), then f, +/2 € 
R (g; a, b) and 4 b b 
[ hrtoae = [fae] fade 


(ii) The proof of (a) is very easy, that of (6) follows from exercise 
6(a), 3. | 
Theorem 6.14. 
(i) Iff,, fo € R(g; a, b), then f, fo € R(g; a, 5). 
(ii) If inf |f(x)| > 0 and fe R(g; a, b), then 1/fe R(g; 4, 4). 
axzr<b 


Proof. 
(i) There is a K such that |f,(x)|, |A@)| < K for a<x< 5b. 


Then 
AMAM -AMRM| 
= |AMAW-AWMAO tAOLO -AOLO| 
< K|AM-AM|+K/AM-LO! 
and so, in any subinterval of [a, 5], 
sup fi(x) fox) — inf A) fa) 
—< K{sup f(x) —inf f\(x)} + K{sup fi(x) —inf A(x}. (6.17) 
Now, given e > 0, there is a such that 
SD, fi 2)-ND,fi,g)<¢ and S(Q,f 8)-XY, fe) < € 
and, by (6.17), 
S(F, fife, 8)-(F, fife» 8) 
< K{S(Q, fr, 8)— (QD, fry 8)} + K{SD, fas 8) - (I, fun 8} < 2Ke. 
ii) If inf OO) = k (> 0), we have 


fo)-f| . \fO)-F0| 
Twfe |< #2 


Wie ate ce 
flu) flv) 


ao 
— 


Hence 
SF, Uf, g)-(F, if, g)< (Uk) {SBF 8) -(F,F, g)} 


and the right-hand side may be made arbitrarily small. | 
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Theorem 6.15. If f € R(g; a, b), then | f| € R(g; a, 6) and 


Fae < [\flas. 


Proof. Since [| f()| — |F@)|| < |fW-SO)|, 
SQ, |f|,2)-sF, |fl.2) < SO.49)-(.S 8) 


and so | f| € R(g; a, 5). 
Also, since —|f| < f < |/|, it follows from exercise 6(a), 5 and 
theorem 6.13, (i) (a) that 


b b b 
-[Iflds < [fae < fis las. | 
b 
It is easy to give an example showing that | |f|dg may exist 


b 
without | fdg existing. (See exercise 6(a), 6.) 
Lemma. Ifa < b < ¢, 
fb fe fe 
@ | fae+| Fae = | Fas, 
b c c 
Gi) | fde+| "sae = | fae. 


Proof. We prove (i); the proof of (ii) is similar. 
Given e > 0, there are dissections Y,, Z., Z of [a, b], [b, cl, [a, c] 
respectively such that 


s(,) |" fag <e, S(Z,) ~| fag < 6, s(2)— |" fae < 6, 


Let &’ be the dissection of [a, c] containing all the points of division 
of J, 9, and &,. In particular, J’ has b as a point of division. Now 
let Di, D be the dissections of [a, b], [b, c] respectively induced by 


9’. Then S(D2) + (BD) = SB’ (6.18) 
and 


fo fe 
—e< | fae-s(@9 <0, -e< | fdg—S(22) < 9, 
a b 


O0< s(a')— |" fag < € 
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Adding these inequalities and using (6.18) we have 
6 fe fe 

= | fag + | fag | die << 
a b a 
Since this holds for all e > 0, (i) follows. | 


Theorem 6.16. 
(i) If fe R(g; a, b) and fe R(g; b, c), then fe R (g; a, c) and 
c b c 
[a= [ae fe 
(ii) If fe R(g; a, b) and if [c, d] is any subinterval of [a, b], then 
fe R(g; c,d). 
Proof. 
(i) By the lemma, 
c b c Te 
[ye = ref J 


(ii) Again, by the lemma, 
C d b b fe fd fb 
[pe face = [rte = [ee [st ra 


and so 


(fe) «(Tre [to Pref) - 


Since each of the three terms on the left is non-negative, each term 
is 0. | 


Notes. 

(1) By @), fe R(g; a, b) if f is piecewise monotonic, i.e. if [a, 5] 
may be divided into a finite number of subintervals in each of which 
f is monotonic. 

B 
(2) In view of the definition of | fdg when « 2 #, we need not 


assume in (i) thata < b < c. 


Theorem 6.17. (Change of variable.) Let fe R(g; a,b). If ¢ is a 
continuous, strictly increasing function on the interval [a, £| such that 


$(a) = aand $(f) = b, thenfo pe R(go 9; &, f) and 
[ fae = [Fo # ago). 
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Proof. The relation x = ¢(t) (x¢é[a, b],te[a, ]) sets up a bi- 
jection between the dissections J of [a, b] and the dissections Y* of 
[x, 6]. Also, if 2, Z* correspond, 


(2, f, g) = (D*, fog,g09), S(D,f, 9) = S\Z*, fo g, go 9) 
and therefore 


[@ = [(ve p)d(g0 4), [fas = Kc $)d(go 4). | 


Other simple properties of the Riemann-Stieltjes integral are 
given in exercises 8-10 below. 


Exercises 6(a) 


1. Show that, if J’ is a dissection of [a, b] which contains all the points of division 
of the dissection Y, then 


SD .f.2)< SD,f8), (W.fe) = (Q,f, g). 


2. Prove that, when k is a constant, 


ih (ifde = kf fas, | “Ofide = k | “fég (k> 0) 


Tb b b Tb 
[, (hae = kl fds, [de = kl fae k <0). 
3. Prove that 
Tb Tb Tb b b b 
| : fd(gi1+28s) =|. fae.+ | fdgo, ie fd(git+ 282) = { : fdg,+ | : fdge. 


4. Let fe R(g,; a, b)N R(g2; a, 6). Show that, if g,—g. is increasing, then 
fe R(ei—82; 4, 5) and 


b b b 
{ pet [ fae [ tae 
a a a 
5. Show that, if f(x) < f(x) for a < x < 5b, then 
Tb Tb b b 
[fae <[ fae, [fae <| fede 
a a Ja Ja 


6. Construct a function f such that, for any non-constant g, |f| € R(g; a, b), 
but f¢ R(g; a, 5). 


7. Find functions f, h, g (where g is increasing) such that 


Fo Fb Fo b b b 
| (f+ h)dg <| fae | hdg, { (f+ h) dg >| fae+ | hdg. 
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8. Suppose that f, he R(g; a, b) and h(x) = O for a < x < b. Show that there 
is a number A between the infimum and the supremum of f on [a, 5] such that 


b b 
| thdg = a{ hdg. 
a a 


(This result with h(x) = 1 is the first mean value theorem.) 


9. Show that, if fis continuous on [a, b] (and g is increasing), then 


b 
SQ, f, 8), (F, f, g) ~| fdg 
a 
as (D) > 0. 
10. Let fe R(g; a, b) and define the function F on [a, 5] by 


F@) = [20 (a= x = Bb). 


Prove that 
(i) F is continuous at every point of continuity of g; 
(ii) Fis differentiable at a point where fis continuous and g is differentiable 


and, at such a point x, ae : 
. F(x) = f(x)8'@). 


11. Let g be increasing on [a, b]. Show that, if f is bounded on [a, 5] and 
b b 

I fdg exists whenever 0 < h < b—a, then | fdg exists. 
ath a 


12. A function ¢ defined on an interval containing the point c is said to be /eft 
continuous at c if d(c—) = Pc), ie. 


fe 8 A(x) = P(C). 


Right continuity is similarly defined. 
Let f be bounded and g increasing on [a, b]. Prove that, if f and g have a 
common left discontinuity or a common right discontinuity, then f¢ R(g; a, b). 


13. Suppose that, for n = 1, 2, ...,f,¢ R(g; a, b) and that /, > /f uniformly on 
[a, b]. Prove that fe R(g; a, b) and 


b b 
[ fade > [fae 
a a 
as n -> ©, 


14. Show that, if f is non-negative and continuous on [a, 5] and g is strictly 
increasing, then 


({ fede)” > sup f(x) 
axax<xb 


6.2. Further properties of the Riemann-Stieltjes integral 


The material in this section is rather more sophisticated than~tffat 
of §6.1. Familiarity with it is not necessary for a reader who is 
primarily interested in the Riemann integral. We begin by showing 
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that, if the integrator has a Riemann integrable derivative, or is an 
indefinite Riemann integral, then the RS integral reduces to an 
ordinary Riemann integral. : 


Theorem 6.21A. Let fe R(a,b). If the increasing function g is 
differentiable in [a, b] and g’ € R (a, b), then fe R(g; a, b) and 


[ reads = ['seoe'eoax 


Proof. If the theorem has been proved for non-negative f and so, 
in particular, for a non-negative constant, it holds for any fe R (a, b). 
For we can then write f = (f+ C)-—C, where C is such that 


f(x)+C 20 


fora < x < b. We may therefore suppose that fis non-negative in 
[a, b]. Since g’ is also non-negative, in any subinterval of [a, 5], 


inf f(x) inf g’(x) < inf f(x)g’(x) < inf f(x) sup g(x). (6.21) 


As g', fg’ = R(a, 6), a now familiar argument shows that, given 
é > Q, there is a dissection Y of [a, b] such that the inequalities 


S(J, g')—S(Q, 2’) < «, (6.22) 
S(Q, fe')— (QB, fe’) < € | (6.23) 
both hold. If F is given by 
62% 2a Si. x pS ep =D, 


denote the infima of f(x), g’(x), f(x)g’(x) in [x;_1, x,] by m,, mi, D; 
and the supremum of g’(x) by M;. By (6.21), there is a number yp; 
such that m; < uw, < M; and 

Py = M%- 


Also, by the mean value theorem, there is a number ,; such that 
mM; < “4; < M;, and 


8(X:) -—9(%i-1) = Mi — X;-1). (6.24) 
Then, by (6.23), 


b n n 
| f Os * DAX%i— Xj) = Py My bei, + (My — Hi)} (%; — X41) 


= bs m,{g(x;) — g(x;_1)} is = m,(L; — a) (x; —X; aut 
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But, if M= ease T(x), 
then, by (6.22), 


7 x Mil; — f;) (X%;— X;-)| < M a (M;—m;)(x;—%;-1) < Me 


and so [2 < (2,f,z)+(M+le < |” fg + (M+ Ie, 


This holds for all ¢ > O and therefore 


[fe < |. fe. 


It is shown similarly that 


b fb 
| fg’ > | fdg. | 


Theorem 6.21B. Let f, g © R (a, b) and let G be an indefinite integral 


of g, i.e. : 
G(x) = | g(tidt+K (a<x< Jb), 


where K is a constant. If g is non-negative on [a, b] (so that G increases), 
then f € R(G; a, b) and 


b b 
| : f(x)dG(x) = [ f(x) 2(x) dx. 


Proof. The previous proof with g’ replaced by g, and g replaced by 
G applies almost word for word. The only difference is that the 
analogue of (6.24), which reads 


G(x,) ae G(x;-1) et L(x, ae Ke is 


is obtained by remarking that G(x;) — G(x;_,) lies between m;(x;— x;_}) 
and M;(x;—x;_,). (Alternatively, see exercise 6(a), 8.) | 


It will be seen later (in theorem 6.84) that, if g’ is R-integrable, 
then g is an indefinite integral of g’. Thus, theorem B in fact includes 
theorem A. 


Having disposed of smooth integrators we — to the consideration 
of discontinuous ones. 

The real function ¢ on the interval [a, b] is said to have a jump 
discontinuity at the point x in (a, b) if d(x—) and ¢(x+) exist, but 
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are not both equal to ¢(x); the definition is appropriately modified 
for x = aor x = Db. Also, as in exercise 6(a), 12, we call ¢ continuous 
on the left [right] at x if d(x—) = (x) [6(x+) = G(x)]. Since an 
integrator g on [a, 5] is increasing, any discontinuities of g must be 
of the jump variety. The simplest discontinuous integrator is a 
function which is constant except fora jump at a single point. If the 
jump is not at an end point, then the function is of the form 


= tore = x.< 4, 
g(x) = 


B feru< x <b, 
with «a < f and a < g(u) < f. Otherwise the form is 


«a forx=a [% fora = x <5, 


a =| Or 1, fo 


Stet a = xs b 


Theorem 6.22. Let g on [a, b| be constant except for a jump at the 
point u. Then fe R(g; a, b) if and only if f is continuous on the left 
when g is discontinuous on the left and continuous on the right when g 
is discontinuous on the right. If f € R(g; a, b), then 


(fl {e(a+)-8@} w= a), 
[ fas = | fe){eu+)-su-)) @<u<d, 
fO){g)-ab-)} wu =D). 


Proof. When f, g have a common discontinuity on the left or on 
the right, then, by exercise 6(a), 12, f¢ R(g; a, b). 

Suppose now that there is no such common discontinuity. 

First let a < u < b. We shall show that fe R(g; a, u) and that 


[fae = f0d{g-su-) (6.25) 


This clearly holds when g(u) = g(u—), i.e. when g is constant on 
[a, ul. When g(u) > g(u—), take ¢ such that a < t < uand let DZ be 
the dissection of [a, u] with a, t, u as the sole points of division. Then 


inf f@){e)-8@)} = iF) < [Fas < [as 


< SM) = sup fo) {su —g(0}. 


Since f 1s now continuous on the left at u, the first and last of these 
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expressions tend to f(u){g(u) —g(u—)} as t > u—. Thus fe R(g; a, u) 
and (6.25) holds. 

When a < u < 5, we prove similarly that fe R (g; u, b) and 


[fas = feo fou+)—s(u). | 


Lemma 1. Any increasing function is the sum of a continuous increasing 
function and of countably many functions each of which is increasing 
and is continuous except for a single jump. 


Proof. Let g be an increasing function on [a, b]. By exercise 1(c), 
10, the set of discontinuities of g is countable. Denote the points of 
discontinuity by u,, where k ranges over a countable set A which 
may be finite, possibly empty. If a < u, < b, define the function g, 


by 0 fora < x < %, 
g(x) = \ 8x) -8(u—) for x = Uz, (6.26) 
g(u,+)—g(u,—) foru, < x < b. 
To define g;, when uw, = a take g(a—) = g(a) and omit the first line 
of (6.26); when u, = b, omit the last line. When A is infinite, Dg, 


converges uniformly on [a, 5], for Xg,(b) converges (the partial sums 
being bounded by g(b) —g(a)) and 


0 < g(x) < gb) <x < d). 
Hence, whether A is finite or infinite, the function 


J es bd Sk 
keA 


(taken to be identically 0 when A is empty) is continuous except at 

the points u, (k € A). Thus a 
is continuous at all points x + u,(k ¢ A). But, by the definition of 
Sx» & — Fy 1S continuous at u;,. Since g, (i¢ A, i + k) is also continuous 
at u,, it follows that g* is continuous on [a, 5]. 

The function g* is increasing. To prove this we first note that the 
intervals J, = [g(u,—), g(u,)], J, = [e(u,), g(u,+)] are non-over- 
lapping. Now let a < « < # < b. Then the sum of the lengths of any 
finite number of /,, J, contained in the interval [g(a), g(f)] is less 

-than or equal to g(/)—g(«). The same is therefore true of all the 
I, J;, in [g(«), g(A)]. Thus 2 18i(2)— 8} < g(B)—g(@), ie. 


g*(x) < g*(A). | 
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Given the increasing function g, the functions g* and / defined in 
the proof of the lemma are called, respectively, the continuous 
component and the jump component of g. To obtain the decomposition 
of { fdg corresponding to this resolution of g we need another 
lemma. 


Lemma 2. Suppose that the functions ¢.. g,, ...on[a, b] are increasing 
and that > g,, converges to a function h on [a, b]. Then h increases on 
k=1 
(a, b] and, if f ¢ R(g;,; a, b) for every k, then fe R(h; a, b) and 
b co b 
| far = & | Vee. (6.27) 
Proof. lfa<a< B < b, g,(f)—g,(«) 2 0 for every k and so 


W(A)—Ma) = 3 {ex(A)—eul@)} > 0. 


Thus / increases on [a, 5]. 
Now suppose that fe R(g;; a, b) for every k. Let 


K = sup |f(x)|. 
axx<b 


b 
Then, since [is dg,| < K{g(b) —g:(@)}; 
© b 
it follows that = | dg, 
k=l1Ja 
converges. 


Given € > 0, there is a p, such that, for p > Do, 


5; {g,(b)—g,(a)} < ¢. 
k=p+1 


Since, for every dissection Z of [a, 5], 


(ZF, gx)|, |SBSA an)| < K{gi(b) —2{a)}, 


when p 2 Po; 


CAA 


k=p+1 


: S a9 600) 4 Ke. (6.28) 
k=p+1 


Take any p > py. Then there is a dissection Y* such that 


ESD 8)- OL a} < 63 
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and, using (6.28), we have 
Pp b Pp 
3 | fdee-(K+ Ne < B s(D*f,a1)—Ke 


b ob 
< 19°, fh) < fdh<«< | fan < S(D*,f, h) 
wv a . 
Z = So": fej tk < = | fd, + (K+ Ne. 
Thus, for any p > Do, 
p bd b bd p b 
= | fie, (he Ve < | Saha | far < 3 | fde,+(K+1)e 
and so | 
oO fb b fb oO fb 
: | fie, 2 Dee | fdh < | fie > | fie: Ce Be. 
=lJa Ja a k=1Ja 
Since this is true for every e > 0, fe R(g; a, b) and (6.27) holds. | 


Theorem 6.23. Suppose that g is an increasing function on [a, b] with 
discontinuities at the points u,(k € A). Let g = g*+j, where g* is the 
continuous component and j is the jump component of g. If, now, 
fe R(g; a, b), then fe R(g*; a, b) and 


[ fae = [ faet+ 3 Feu) {e(ue+)—e(ue-)) 


with the convention that g(a—) = g(a), g(b+) = g(b). 


Proof. Let the g, (k € A) be the functions defined by (6.26) so that 
j = 2g, Since fe R(g; a, b), f and g can have no common left or 
common right discontinuities. Hence, by theorem 6.22, for each 
ke, fe R(g;,; a, b) and 


| f dgy, = S(ux) {gee +) —8x(ue—)} = Fux) {e(u, +) —2 (up —)}. 
It now follows from lemma 2 that fe R(j; a, b) and 
[r G= X fu)istet)—8Ux—)}. 
Since g* = g—/ is increasing, by exercise 6(a), 4, fe R(g*; a, b) and 


[fas = [fast + [ ra | 
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It is the last theorem that makes the Riemann-Stieltjes integral 
particularly valuable in applications of analysis. It shows, for 
instance, that in mathematical physics this integral can cope with 
point masses as well as continuous mass distributions. 

We can now also improve theorem 6.12(ii). 


Theorem 6.24. If f is monotonic and g is increasing on [a, b| and 
if f,g have no common left or common right discontinuities, then 
fe R(g; a, 5). 


Proof. Let g = g*+/, where 7 = Xg,. In view of the restriction on 
the discontinuities of fand g, theorem 6.22 shows that fe R (g;; a, 5) 
for all k. Therefore, by lemma 2, fe R(j; a, b). Also, by theorem 
6.12 (ii), fe R (g*; a, b). | 


If, in theorem 6.24, f (as well as g) is taken to be increasing, then 
fe R(g;a, b) and ge R(f;a, b). The symmetry of the situation is 
further exhibited by the formula for integration by parts. 


Theorem 6.25. (Integration by parts.) If the functions f and g are 
increasing on [a, b] and have no common left and no common right 
discontinuities, then 


J J dg +[°2 df = f(b)g(b)-f@s@). 


Proof. Since f and g are increasing functions, we have, for any 
dissection D given by a = X5 < X, <...< Xn-1 < X, = 3B, 


S(D,f, 8) + (QF, gf) 
. 3 F(x) {g(%:) —8(%i_-D} + 2, a(x:-D{S(~%) —f(%i-v} 
= f(b)2(b) —f(ag(a). (6.29) 


But, given e > 0, Z may be chosen so that 


S(O,f,8)-6 < | fag < SOL, 8), 


(FD, g,f) < [ se < 42, 9.f)+€. 


Adding these two inequalities and using (6.29) we have, for all 
e> 0, 


fWa0)-Sasta-e < | fas+| gat < fOe®)-fasta+e. | 
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Exercises 6(5) 
1. Show that, if fis continuous on [0, 00), then for x > 0, 


Z fi) = [ ” Fdltl, 


1<n<x 


where [f] is the greatest integer less than or equal to t. (When 0 < x < 1, the 
sum on the left is taken to be 0.) 

co 
2. The series >) u, converges absolutely. Find an increasing function g on [0, 1] 


n=1 
and a function fe R(g; 0, 1) such that 
Sw = [i fae 


3. Construct functions f, g, (n = 1, 2, ...) on an interval [a, b] with the following 
properties: 

(i) Forn = 1, 2,..., 2, increases on [a, b] and fe R(g,; a, 5), 

(ii) gn >g on [a, 5], but f¢ R(g; a, 5). 


4. The functions g,,(n = 1, 2, ...) are increasing on |a, b] and g, > g uniformly 
on [a, 5]. Show that, if f increases on [a, b] and fe R(g,; a, 6) for every n, then 


fe R(g; a, b) and b > 
[ faazes i file 


6.3. Improper Riemann-Stieltjes integrals 

Improper Riemann integrals were defined in C1 (138). The 
definitions for Riemann-Stieltjes integrals are entirely analogous. 
Definition. Let g be an increasing function on [a, 00). If fe R(g; a, X) 


xX 
for every X > aand lim | /fdg exists, then this limit is called an 


xX>oJ a 


improper Riemann-Stieltjes integral of the first kind and is denoted by 


[a 


Improper integrals have properties like those of infinite series and 
the following theorems illustrate this. The proofs are left to the 
reader. 


Theorem 6.31. (General principle of convergence.) If fe R(g; a, X) 
for every X > a, a necessary and sufficient condition for | fdg to 
exist is that, given é > 0, there is an X, > a such that 


Jef 


<é whenever X, > X, > Xo.. 
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For non-negative integrands the first comparison principle is the 
mainstay of proofs of convergence or divergence of integrals. 


Theorem 6.32. Suppose that f, ¢, ¥ € R(g; a, X) for every X > a. 
(i) If d(x) > f(x) 2 0 for x > aand [- $e exists, then + fdg 
exists. ; — : 
(ii) If f(x) = (x) 2 0 for x > a and | : dg does not exist, then 
z fdg does not exist. 
“When fis of variable sign (and fe R(g; a, X) for every X > a), 


the simplest criterion for the convergence of | fdg is the convergence 
a 


of | | | de’ (see exercise 6(c), 2). This result motivates the definition 
of absolute convergenice. 

If fe R(g;a,X) for every X >a and | |\f|dg exists, then 
| fdg is said to converge absolutely. If | fdg exists, but | | f| dg 
does not, then | fdg is said to be non-absolutely (or conditionally) 
convergent. 


Illustrations. These refer to the simplest and most important case, that of the 
ordinary Riemann integral. 


4s a x : 
(i) | x2 dx converges absolutely, since 
1 


sin x 
< 
x? 


[eo @) 1 : 
and { — dx exists. 
1 <z 


- - Peay x 
(ii) | sng dx converges non-absolutely. 
1 


Xo X x 
3 sin x cos x |- cos x 
First — dx = | =] ~ | : dx 
1 x x 1 1 x 


As X > 0, the first term on the right tends to cos 1. The second term also con- 
; © sin x ; 
verges, since |cos x| /x? < 1/x?. Hence { — dx exists. 
ae 


sin x 


dx diverges, note that 


(n+1)7 |sin x| 1 tA a) 
dx 2 sin x| dx = ————. 
a x (n+1)7 Jnn | | (n+1)7 


lo 6) 
To prove that { 
1 
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The other two kinds of integrals with infinite range need not detain 


us. The definition of i fdg is obvious. | : fdg is defined as 


[fae {“Fae: 


the sum of these two integrals is clearly independent of a. 


The second kind of improper integral is treated in exactly the same 
way as the first kind and we do little more than give the definitions. 


Definition. Let g be an increasing function on (a, b]. If 

SER(g; ath, b) 
whenever 0 < h < b—a (so that f is bounded on every interval 
[a+h, b]) and if ave | . dg exists, then this limit is called an improper 


Riemann-Stieltjes integral of the second kind and we denote it by 
b b 
| fdg, or simply | dg. 
a+ a 


Notes. | 
(1) If f is bounded on [a,b] and fe R(g;a+h, b) whenever 
0<h < b-a, then fe R(g; a, b). (See exercise 6(a), 11.) Hence 


b 
the definition of | fdg is likely to be invoked only when / is 
a+ 
unbounded near a. : 
(2) Suppose that fe R(g; a,b). Then i fdg exists, but the 
a+ 


improper integral need not be equal to the ordinary integral. (See 
exercise 6(c), 6.) The two integrals do coincide when g is continuous 
at a and so, in particular, in the case of the Riemann integral. (See 
exercise 6(a), 10(i).) 


b- b 
The definition of fdg (or simply i fdg) is analogous 
> a a 
to that of | fdg. Also, if a= upg < Wy <...< uy, <u, =b 
a+ 


and. for i= 1,44 | fdg exists as one of the improper integrals 
UWi-1 
just defined, we call (unambiguously) the sum of these integrals 


b 
the improper integral | dg. 
a 


6.3] IMPROPER RIEMANN-STIELTJES INTEGRALS 159 


The two kinds of improper integral may occur together. For we 
20 | x x 

may define | fdg as lim fdg when | fdg exists either as an 
a: xo Ja a 

ordinary RS integral or as an improper integral of the second kind. 


Series of improper integrals. We‘are once again concerned with the 
reversal of two limiting processes. 


Theorem 6.33. Suppose that g is ‘an increasing function on [a, ©), 
that the functions f,,(n = 1, 2; ...) are all non-negative on [a, 00) and 


 R( fina). (Zale an 


exist and are equal for every X > a, where the integrals are ordinary 
or improper. If, now, one of the expressions 


> ({" ude) f ( 3 fn) dg (6.32) 


exists, then so does the other and the two are equal. 


Proof. Suppose that the first expression in (6.32) exists. Since the 
version of theorem 4.73 with one continuous and one integral 


N fx 
variable is applicable to & | f,,dg, and since the two expressions in 
n=l1Ja 


(6.31) are equal, we obtain the following chain of equalities in which 
the existence of any term implies that of the next. 


5 (finee) =, 


(sim ‘Zz fd] a ( [_ fade) 


x—o n=1 


lim “(3 fala = & fra) | 
xX—-ao Ja a \n=1 

In most applications of theorem 6.33 the interchangeability of 
summation and integration in (6.31) is ensured by the uniform con- 
vergence of X/,. 

It is evident that theorem 6.33 may be restated in terms of improper 
integrals of the second kind. 


Uniform convergence. The notion of uniform convergence may be © 
applied to improper integrals of both kinds. It is sufficient to con- 
sider the first kind. 
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Definition. Suppose that the function g on [a, 0) increases, that E is 
a set (not necessarily equipped with a metric), and that f is a real- 
valued function on [a, 00) x E. If 


[40 dso (6.33) 
exists for every y € E and if 


>0 as X>O, 


| ” (2, y)de(x) 
2.4 


then the integral (6.33) is said to converge uniformly on E. 

The theorems of §5.3 may be recast so as to apply to improper 
integrals. The analogues of the most important of these (5.31 and 
5.32) are easily obtained; the rest have to await further develop- 
ments. (See §6.9 and exercise 8(g), 1.) 


sup 
YEH 


Theorem 6.34. (General principle of uniform convergence.) Suppose 


b 7 
that | T(x, y)dg(x) exists whenever b > a and yé& E. A necessary and 


sufficient condition for | I(x, y)dg(x) to converge uniformly for ye E 


is that, given e > 0, there is an X such that 
x, 
[_ fos devo 


Proof. The necessity of the condition is obvious. To prove the 


<é whenever X, > X, 2 X. 


sup 
yceH 


sufficiency we first note that, by theorem 6.31, | = T(x, y)dg(x) exists 


for every y EE. 
Now consider a particular y € E. Since 


Xe 
| Sx, yde(x) 
re 


<e Whenever “XX, > 74,24. 


and since | u T(x, y)dg(x) exists, it follows that 
= 


< é. 


J, ses deo 
x, 
But this holds for every y € E and therefore 


2 I(x, y) de(x) 
Xi 


<e for 4,3 2.) 


sup 
YEH 
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b 
Theorem 6.35. (Weierstrass’s M-test.) Suppose that | F(x, vydg(x) 
exists whenever b > a and y€E. If there is a function M on |a, ©) 


such teat it) <M) for a exe Oy EL 


and | ° M (x) dg(x) converges, then | St (x, y) dg(x) converges (absolutely 


and) uniformly on E. 
The proof follows almost immediately from theorems 6.31 and 
6.34. 


Illustration. If c is any positive number, 
co 
{ e~*” sin x dx 
0 
converges uniformly for y > c. To prove this we can use the M-test, for, when 
x2>0Oandy2ec : 
5 ee le-v sin x| < e-*, 


Theorem 6.33 shows that the convergence is not uniform for y > 0, since, if 


is an integer, 
(n+1)7 ; 
[ e-* sin xdx 


nn 


sup 
y>0 


> sup (2e-@+ 0) = 2, 
y>O0 


In §8.7 we shall discuss functions ¢ defined, by means of uniformly 
convergent integrals, in the form 


869) = | fle ye. 


Exercises 6(c) 
1. Prove theorem 6.31. 


foe) ce 
2. Show that, if fe R(g; a, X) forevery X > a and | f| dg exists, then [ fSdg 
a a 


exists. 


© sin 
3. Show that { de 
2g log x 
exists. 
le @) 
4. Prove that { cries 
1 xt+sinx 


converges non-absolutely. 


5. Prove that, for any constant a, 


(n + je? es nett 


7 >ati 
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as n > 0, Deduce that, if fis any positive decreasing function on [1, 00), then 


Si fly cad | aide 


n=1 1 
both converge or both diverge. (The case a = 0 is the Maclaurin—Cauchy 


integral theorem proved in C1, 141.) 
b 
6. Let fe R(g;a,b). Show — lim i fdg exists, but that, if g is discontinuous 
—0+ Jath 

at a and f(a) + 0, then 

b b 
lim fdg + i fdg. 

ath 


h—-0+ a 


© 106 2% 
7. Eval ; 
valuate [Le G+ 1 dx 


8. Find the values of «, @ for which 
(oe) a 
| = 
0 1+x? 


9. Let c > 0. Show that 2 e168 C1) 
n=0 


exists. 


converges uniformly in every interval [a, 4] such that O<a< f# <1. By 
applying twice the analogue of theorem 6.33 for improper integrals of the second 


kind prove that Se dacs, f rors = 
n=o(ut+c)? Jo 1—x : 
foo) 
10. Show that [ ely dx 
0 
converges uniformly on any interval (0, Y], but not on (0, 00). 
(ee) 
11. Prove that I cine 
9 x+sin y 
converges uniformly on (— ©, ©). 
eee 5 
12. Show that [, 1+ x*y2 dx 


(i) converges for —0 <y< @; 
(ii) converges uniformly for |y| > c > 0; 
(iii) does not converge uniformly for —0co < y < ©, 


6.4. Functions of bounded variation 

Our interest in these functions is twofold. First, they may be used 
as integrators in Riemann-Stieltjes integration. Secondly, there is a 
natural bond between them and rectifiable curves, i.e. curves of finite 
length. (See §8.1.) 
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Let f be a real function on the finite interval [a, b]. If D is the 
dissection of [a, b] given by | 


@ eX KX Se KX < Mn = 9, 
put V(9,f) = & \fed-feea)| 


Definition. If V(Z,f) is bounded for all dissections D of |a, b}, the 
function f is said to be of bounded variation on [a, b] and 


vicf) = sup VQF,f) 
77] 


is called the total variation of f on [a, 5]. 


Notes. 
(i) If f is monotonic on [a, b], then f is of bounded variation on 
, b] and 

Jas Vif) = |fO-S@) 


(ii) If f is continuous on [a, b] and has a bounded derivative in 
(a, b), then f is of bounded variation on [a, b]. (See exercise 6(d), 1.) 
(iii) A function of bounded variation is bounded. 
(iv) (a) A function of bounded variation need not be continuous 
(since, for instance, a monotonic function need not be continuous). 
(b) A continuous function need not be of bounded variation. 
For let f be given by 


1 
fd) = x COS 5— (x + 0), 
0 (x = 0). 


Then f is continuous everywhere, but is not of bounded variation on 
any interval including the origin. Consider, for instance, the interval 
[0, 1]. If G,, is given by the points 


1, 
>On’ 2n—1’? 2n—2’ 2n—3’ 3’ 2’ ” 
Sees | 1 ] Seek 
Gee VOWS) = st a 3! m0 19a? 
s- 
te 2 eco n—1 n 


Therefore V(Q,,, f) > 0 asn—> oo and sof is not of bounded varia- 
tion on [0, 1]. 
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It is easily shown that elementary operations on functions of 
bounded variation yield functions of bounded variation. 


Theorem 6.41. If f, g are of bounded variation on [a, b], then so are 
\f|, f+2, fg. If, inaddition, inf |f(x)| > 0, then 1[f is also of bounded 
variation. — 


Theorem 6.42. 

(i) If f is of bounded variation on [a, c| and ifa < b < c, then f is 
of bounded variation on [a, b] and on [b, c}. 

(ii) If f is of bounded variation on [a, b] and on [b, cl, then f is of 
bounded variation on [a, c] and 


Vif) = Vif) + Vif). (6.41) 
Proof. 
(i) Let D,, Z, be any dissections of [a, 5], [b, c] respectively and 
let DZ, be the corresponding dissection of [a, c]. Then 
V(Df)+VOnf) = VDof) < Valf). 
Therefore fis of bounded variation on [a, b] and on [b, c] and also 
Val) +V5P) < Via(f). (6.42) 


(ii) Let D be any dissection of [a, c] and let Y* be D with b as an 
additional point of division if it is not already a point of Z. Let Y’, 
Y" be the dissections of [a, b], [5, c] induced by Y*. Then 


VBS) < VO) = VALA)+VGS) < Vaf)+ Vif). 
Hence fis of bounded variation on [a, c] and also 
Va) < Va(f)+ Vi). (6.43) 
(6.42) and (6.43) give (6.41). | 


We can now establish a simple alternative characterization of 
functions of bounded variation. 


Theorem 6.43. A function is of bounded variation if and only if it is 
the difference of two increasing functions. 


Proof. An increasing function is of bounded variation and, by 
theorem 6.41, the difference of two such functions is also of bounded 
variation. 
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Now suppose that fis of bounded variation on [a, 5]. In view of 
theorem 6.42, the variation function v, defined on [a,b] by the 


equations 
0 for ¥ = a, 
v(x) = 
Pf) fora < x < 6, 
exists and is increasing. Next put 


Wy = U-f. 
Whena<x<y< BJ, 


{u,(y) — v(x} —(f() —F)} 
Vuf)-{f0) —f()} 2 0. 


Therefore w, increases on [a, b]. Also f = vs—Wy. | 


wy(y) — W(x) 


Corollary. Any discontinuities of a function of bounded variation are 
jumps. Consequently the set of such discontinuities is countable. 


If u is any increasing function, then (in the notation of theorem 
6.43) v,;+u and w,+u are increasing and f = (v;+u)—(ws+u). So 
the representation of a function of bounded variation as the differ- 
ence of two increasing functions is not unique. 

Theorem 6.43 shows why functions of bounded variation may be 
made to serve as integrators. The next theorem is also relevant to 
Riemann-Stieltjes integration. 


Theorem 6.44. Let f be of bounded variation on [a, b|. Then v, is 
continuous on the left (right) at the point & in [a, b] if and only if f 
is continuous on the left (right) at &. 


Proof. We consider left continuity at a point £ such thata < £ < b. 
First suppose that v, is continuous on the left at €,° ie; that 
v,(x) > v,(£) as x > €—. Since, fora < x < §, 


If)—-SO)| < Vif) = 0,(6) — v(x), 


it follows that f(x) > f(§) as x > §-. 
Now suppose that f is continuous on the left at &. If ¢, # are 
increasing functions such that f = d—y, then 


{o(£) — g(E-J}- YH)—WE-)} = FEE)-F(E—) = 9, 
and so g(E)- d(E-) = W(E)-W(E—-) =k 2 0. 
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Define the functions ¢,, %, on [a, b] by 
O(x)t+k (a<x< §), 
p(x) = 
P(X) (§ <x < dD), 
W(xy+k (a<x< S), 
W(x) (2 xe. 8). 


Then ¢,,¥%, are increasing functions on [a,b], f= ¢,—%,, and 


PilS—) = O1(5), ¥ilE—) = Wy(E). Also fora < x < &, we have 
0 < 0,(6)—v,(x) = Vif) < Vib) + Vay) 
= {1(E) — Pi()} + YS) — Y)}. 


Since the right-hand side tends to 0 as x > —, v,(£—) = v,(&). 
Right continuity at a point € (a < & < 5b) is dealt with similarly. | 


Yri(x) = | 


Corollary. Let f be of bounded variation on [a,b]. Then the two 
increasing functions v,, Ws (which are such that f = v;—wy) are con- 
tinuous on the left (right) wherever f is continuous on the left (right). 


If f is of bounded variation on [a, b], define the functions p, and 
gy on [a, b] by 


Py = Hv,+f-f@}, a = Ho,-f+S(@}, 
sothat pa) =q,(a)=0 and p,-q=f-f@. (6.44) 


It is easily seen that p, and q; are increasing functions. Also, by 
theorem 6.44, p; and qg; are continuous on the left (right) wherever 
f is continuous on the left (right). The functions p, and q; are called, 
respectively, the positive and negative variation functions of f. (Exercise 
6(d), 3 supplies the reason for this nomenclature.) These functions 
enable us to characterize all representations of a function of bounded 
variation as the difference of two increasing functions. We first show 
that p,; and g,; are the smallest increasing functions satisfying (6.44). 


Lemma. Let f be of bounded variation on [a, b]. If r, s are increasing 
functions on [a, b] such that 
r(a) = s(a)=0 and r—-s =f-f(a), (6.45) 


then r(x) > p;(x) and s(x) > q,(x) fora < x < b. 
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Proof. Take x such that a < x < band let Z be any dissection of 
[a, x]. Then 


V(Z.f) = = (f(x) -fO%i-d| 
= & |{r(x;)—r(%i-)} — {s(%:) — 5%-»)} 
< 2 {r(xi)-7 wD} + (5%) — 8%} 
= r(x)+5(x). 
Since this holds for all dissections of [a, x], 
py(x) +(x) = v(x) = VI(f) < r(x) +s(2x). 
It now follows from the second equations in (6.44) and (6.45) that 
pAx) < r(x) and q(x) < s(x). | 


Theorem 6.45. Let f be of bounded variation on [a, b]. If u is an increas- 
ing function on [a, b] and u(a) = 0, thenr = p;+uand s = qy+u are 
increasing functions on [a, b] such that 


r(a) = sa) =0 and r-s =f—-f(a). 


Also every pair of increasing functions r,s satisfying these equations 
is of the form r = ps+u, S = g;+u, where u increases on [a, b] and 
u(a) = 0. 


Proof. The first statement is obvious. To prove the second, take 
any points «, # such that a < a < # < band put 


pt = Ds—D(), g* = G5 —9(%), r* =r—r(a@), s* = s—s(a). 


Then the restrictions of p*, g* to [a, f] are clearly the positive and 
negative variation functions of f on [«, /] and 


r*(a) = s*(a)=0 and r*¥—st* = f—f(a). 
Therefore, by the lemma, p*(f) < r*(f), 1. 
r(P)—p(P) 2 r(%) —p,(@). 
But «, # are arbitrary and so u = r—py = S—q; increases on [a, 5]. | 
Exercises 6(d) 
1. Prove Note (ii). 


2. Prove theorem 6.41. 
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3. Let fbe of bounded variation on [a, b]. For the dissection F of [a, b] given by 
Aa=XxXo9< xX, ca — Xq 4 SH Ay = 8, et 


POS) = 5 Slfxd feed) +60) Soa, 
OD.) = 5 SIF) Flr) - UV) Soa) 


(so that PP, f), Q(D,f) are the sums of the terms | f(x,)—f(x,_,)| for which 
f(x:)-—f(%i:_1) = 0 and < 0 respectively). Let 


P*(f) = sup PY, f), Q2(f) = sup O(F,f), 


where each supremum is taken over all dissections of [a, 5]. 
Prove that 


PUP) + OMA) = Vi) and Pi(f)- Of) = fO@)-f@ 
and deduce that, fora < x < b, 
Pf) = pfx) and QO%(f) = gfx). 
4. Let fe R(g; a, b) and define the function F on [a, b] by 


F(x) = [tae. 


Prove that fis of bounded variation on [a, b] and that 
van) = [ iflae. 
Show also that, if ft = 4(|f| +f), f- = 4(|f| —S/), then 
pany = [rds on = | Frav. 
5. Construct functions f, (n = 1, 2, ...) on an interval [a, b] such that 
(i) (f,) converges uniformly on [a, 5], 


(ii) each f, is of bounded variation on [a, 5], 
Gii) lim f, is not of bounded variation on [a, 5]. 


6.5. Integrators of bounded variation 


A function g of bounded variation can be expressed in the form 
g = r-—s, where r and s are increasing functions. It is therefore 


natural to define b b b 
[ fas = [ far-[ fds 


when both the integrals on the right exist. But the expression of g 
as a difference of increasing functions is not unique and so we have 


b 
to show that our definition of | dg is independent of the particular 
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pair r, s. Let r,, s, be another pair of increasing functions such that 
| | fdr, | fds, 
a a 


both exist. Then, since r+s, = 7, +5, theorem 6.13 gives 


[far+ | fas, = [fans | fas, 


which is what was needed. 
b b 
When f is continuous, | fdr and | fds exist for every pair /, s. 
a a 


But when / has a discontinuity, then, even if fis integrable with 
respect to some pair r,s, there exist others with respect to which it 
is not integrable. For, given r and s, the increasing function u may 
be so chosen that r+u and s+u have a discontinuity at the same 
point as f. We therefore call f integrable with respect to g if there is 


b b 
some pair r, s such that [ fdr and | fds both exist. 


The functions Po» Vy are such that p,—g, = g—g(a) (not g). Never- 
theless it is clear that, if fis integrable with respect to p,, g,, then fis 
integrable with respect to g and 


b b b 
[ fas = | far,-| fate (6.51) 
for p, + 2(a), q, is a decomposition of g into increasing functions. The 
next theorem shows, conversely, that, if fis integrable with respect 
b b b 
to g, then | Sdp, and | dq, exist so that | fdg may, in fact, be 
defined by (6.51). 


Theorem 6.51. Let g be of bounded variation on the interval {a, b\. If 
fis integrable over [a, b] with respect to g, then f is also integrable with 
respect to each of the variation functions V,, Dy, Ig Of &- 


Proof. Let r,s be two increasing functions on [a, b] such that 
g =r-—s and f is integrable with respect to each of r,s. Given 
e > 0, there is a dissection D of [a, b] such that 


SZ, f,r)-\Z,fr)<¢ S(D,f,8s)-(D,f, 8) < € (6.52) 
and V(Z, g) > V2(g)—e = v,(b)—«. . (6.53) 
Since |g(2) —a(@)| < [r(2) —r(@)] +[s(4) — 5()], 
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the two inequalities of (6.52) give (in the usual notation) 
2 (M;—m,)|g(x;)—g(%i-)| < 2¢. (6.54) 


Also, if K = sup |f(x)|, then by (6.53), 
b 


aSLS 


O< > (M;—m,) {[v,(x;) —0,(%;-)] - Fy (x;) —3(x;-)]} 


< 2K = {[vg(i) — 0,(%s-1)] — |g) —20%:-D)]} 


= 2K {v,(b)-—V(Q, g)} < 2Ke. (6.55) 
From (6.54) and (6.55) we now obtain 


E (M—m) (o,(x) v(x} < AK+I)e 


and so fe R (v,; a, 5). 

As p, = }{v,+g-—g8(a)} = t{v,+r—s—g@} is an increasing 
function, exercise 6(a), 4 shows that fe R(p,;a,b). Similarly 
fe R(q,; a, b). | 


Most of the theorems on Riemann-Stieltjes integration with an 
increasing integrator are easily extended to integration with respect 
to a function of bounded variation. Where there are differences, 
integrators of bounded variation usually lead to a more symmetrical 
result. 

The integral is linear in the integrator as well as the integrand: If 
k,, k, are any constants, 


[Gifithfdde =k [ fidetke| fide 6.56) 


“Pb b b 
and | Fare + kee) = Key | Side, + ky | fd» (6.57) 


where in each case the existence of both integrals on the right implies 
the existence of the integral on the left. This is the analogue of theorem 
6.13. We have now removed the restriction k > 0 which was necessary 
in part (1i)(a) of that theorem since integrators had to be increasing. 
Theorem 6.14 takes the same form as before, but the analogue of 
theorem 6.15 does not have quite the appearance of the original. 
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Theorem 6.52. If g is of bounded variation on [a, b| and f € R(g; a, 5), 
then |f| € R (v,; a, b) and 


[ fas) < [ Iflavy 


Proof. By theorems 6.51 and 6.15, |,f| is integrable with respect to 
Vy» Pos Ig. Using also the relation v, = p,+q, and theorem 6.13 we 


cl 


if fade] < [ir dp, [s dq, 
< | iflde,+ [if ld = | flav, | 


b 
Corollary. | | fdg 


< sup |f@)| Vale). 


; b b 
Sometimes | fdv, is written | f\dg|. In this notation the con- 
a a 


clusion of theorem 6.52 has the form 


[i Fae| < [UF el 


Theorem 6.16 holds also for integrators of bounded variation. 
The new version of theorem 6.17 appears with a slightly relaxed 
hypothesis. 


Theorem 6.53. (Change of variable.) Let fe R (g; a, b), where g is of 
bounded variation. If ¢ is a continuous, strictly monotonic function 
such that ¢(a) = a and $(f) = b, then fo ¢ is integrable with respect 


togo¢g and b p 
| fas = | (fo #)atgo#), 


Proof. Let g =r-—s, where r, s are increasing functions and 
feR(r; a, b)n R(s; a, b). When @ increases the result follows 
immediately from theorem 6.17. When ¢ decreases (so that ? < a), 
then —(ro ¢) and —(so ¢) are increasing functions and the argument 
of theorem 6.17 shows that 


[fa = ['vona-o9), [sa = [ "(fo 9d -Go 9) 


Hence, using also (6.57) we have 


b oe B 
[fae = |" ro sa(-(g0 o = [fo dalgo 9). | 
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Corollary. The function ¢ of the theorem may be taken to be piecewise 
strictly monotonic when f € R(g; c, d), where | 
c= inf ¢(f), d= -sup (ft). 
a<t<B 


 axct<f 


Lastly we combine the analogues of theorems 6.24 and 6.25. 


Theorem 6.54. (Integration by parts.) If the functions f and g are of 
bounded variation on {a, b] and have no common left and no common 
right discontinuities, then f € R(g; a, b), g € R(f; a, b) and 


| S a | ; gdf = f(b)g(b) —f(a@s(@). (6.58) 


Proof. By theorem 6.44, the pairs of functions p; and p,, p; and q,, 
gq; and p,, qr and g, have no common left and no common right dis- 
continuities. It follows from theorem 6.24 that fe R(g; a, 5) and 
géR(f;a, b). Theorem 6.25 and simple algebra now yield (6.58). | 


Improper integrals with respect to integrators of bounded variation 
can be defined in the now familiar way, but the manipulation of these 
integrals may be awkward. The comparison principle does not hold 
for them (see exercise 6(e), 6); nor does absolute convergence imply 
convergence (see exercise 6(e), 7(11).) 


Exercises 6(e) 


1. Prove theorem 6.21 B with the restriction g(x) = 0 removed. (For the corres- 

ponding extension of theorem 6.21 A see theorem 6.85.) 

2. The function fis continuous on [a, b] and g is of bounded variation. Denoting 

by J a dissection 
am 2 25 Sey SS 


and by &; any point in [x;_,, x,], prove that 
n b 
Es Edtee)-eea} > | fae 
v= a 


as (D2) + 0. (See exercise 6(a), 9.) 
3. Let f, ~ f uniformly on [a, b] and let g be of bounded variation. Prove that. if, 
for n = 1,2,...,f,€ R(g; a, b), then fe R(g; a, b) and 
b b 
[ fede > | rae. 
: a a 
(See exercise 6(a), 13.) ; 


4. Show that the first mean value theorem (see exercise 6(a), 8) does not hold 
for the RS integral with an integrator of bounded variation. 


6.5] INTEGRATORS OF BOUNDED VARIATION 173 
5. Let f be a positive decreasing function on [a, ©) aces _ - (x) = 0. Also let 
g be bounded on [a, 0) and of bounded variation oe oes interval fi X] 


(X > a). Show that, if fe R(g; a, X) for every X > a, then iz fdg exists. 
a 
6. Let fon [7, ©) be defined by 


0 for (Qn—1)7 < x <.2n7, 


LONE a for 2n7 < x < (2n+1)7. 


Show that [ : d(cos x) exists, but that [ f(x) d(cos x) does not. 
T 1 


7. (i) Let the functions f, g on [a, 0) be such that, for every X > a,g is of 
bounded variation on [a, X] and fe R(g; a, X). Show that, if { | f| dv, exists, 
a 


then ‘g fdg exists. 


(ii) Prove that | C= d(cos x) exists, but that tS —— * d(cos x) does not. 


6.6. The Riesz representation theorem 


Every linear function from R” to R” may be represented by an 
nx m matrix (§3.2). This is one of a group of theorems on the general 
form of linear functions associated with various vector spaces. The 
most famous of these results is the Riesz representation theorem 
which characterizes certain linear functions on the space C [a, b] (of 
real functions continuous on the interval [a, b]). It fittingly illustrates 
the power and importance of the Riemann-Stieltjes integral. (It is 
not, however, subsequently used in this book and the proof, which is 
difficult, may be omitted without fear of repercussions.) 


Theorem 6.61. 
(i) To every continuous linear function A: C [a,b] -> R’ there 
corresponds a function g of bounded variation on [a, b\ such that, for 


every f € C [a, 5], b | 
ee | fis. (6.61) 


(ii) Conversely, given a function g of bounded variation on [a, 6}, 
(6.61) determines a continuous linear function A: C [a, b] > R’. 


The second part follows immediately from (6.56) and theorem 6.52, 
corollary. The proof of the first part is easy if one can use the fact 
that a bounded linear function on C [a, b] can be extended, without 
change of norm, to a linear function on B [a, b] (the space of bounded 
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real functions on [a, b]). Unfortunately we cannot prove this with the 
tools at our command and we have to content ourselves with a less 
interesting result which, nevertheless, is sufficient to establish the 
theorem. 

The sequence (/,,) of real functions on [a, b] is increasing if, for 
every x in [a,b], the sequence (f,(x)) increases; and uniformly 
bounded if there is a constant K such that |f,(x)| < Kfora<x<b 
and all n. We denote by C, [a, b] the vector space of all (bounded) 
real functions on [a, b] which are of the form 


lim f, —lim h,,, 


where (/,,) and (H,) are uniformly bounded, increasing sequences 
from C [a, 5]. 


Lemma. A real valued, continuous linear function on C [a, b|] may be 
extended, without change of norm, to the space C,, [a, 5}. 


Proof. Let A: C [a, b] > R} be a bounded linear function. 

(i) Consider a uniformly bounded, increasing sequence (f,,) from 
C [a, b]. The sequence converges and its limit function fis bounded. 

We begin by showing that (A(/,.)) converges. For k > 2, let 


_[ 1 if AD-AKD > 0, 


ee 
Then 


© AGA) 


= AW Ai-v} 


A {> CSS .-| 
S eff) fea} 


k=2 


SUS 


< [Al sup & {@) fia} 
= [All sup tin) AC} 
< [Al sup (£0) AG), 
i.e. the partial sums of >» IA) —A-»D| are bounded. It follows 


that the series 


ACA) + E, AUDA} 
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converges absolutely and so its sequence of partial sums, i.e. (A(/,)) 
converges. 

(ii) Next, let (f7‘) be any other uniformly bounded, increasing . 
sequence from C [a, b] which also has the limit f’ For n = 1, 2,..., 
s et ced 
Pn alae Pn = fn age 


Then (¢,,) and (¢%) are both strictly increasing sequences and clearly 


lim A(f,,) = lim AG@,), lim ACfx) = lim A>). 

Take an integer m,. There is an integer n, such that ¢,, (x) < $7,(%) 
for all x in [a, b]. To prove this let F;, be the closed set of points x 
for which ¢,,,(x) > $%(x). Since the F, form a contracting sequence, 
if none of them were empty, there would be a point € common to 
them all (exercise 3(f), 6) and we should have 

bm) > lim Px(E) = f(8). 


But this is impossible since (¢,,) strictly increases, and so there is an 
n, such that F,,, = @. Continuing in this way we obtain two sequences 
of integers (m,), (n;) such that 


ee i eh Pa 
By (i), the sequence 
AY)» AMPns)s ACP)» ACPng)s ++ 
converges and its limit is equal to both lim A(¢,) and lim A(¢}). 
—— lim A(f,) = lim AC). 


We can now, without ambiguity, define A(/) to be lim A({,). 

(iii) It is clear that, if f,h are limits of uniformly bounded, 
increasing sequences from C [a, b], then so are cf, where c is a non- 
negative constant, and f+. Moreover 


Af) = cAF), AGF+h) = AF) +A). 
We still have to define A(f—h). To do so we suppose that /*, h* 
are of the same type as fj / and that 


f-h = f*—h*. 

Since f+h* = f*+h, we have : 
Af) +A) = ACF*) +H), 
i.e. A(f)—ACh) = ACf*)-—A(H*). 
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Hence we may define A(f—h) unambiguously as A(f)—A(A). It 
follows immediately that A is linear on the vector space C, [a, 5]. 

(iv) Any member of C, [a, b] is of the form f—h, where f, h are 
the limits of uniformly bounded, increasing sequences (f,,), (1,,) from 
C [a, b]. To prove that the norm of the extension of A to C., [a, b] is 
still ||A|| we have to show that 


|AG-h)| < [Al 
where K = sup | f(x)—A(x)|. We define the functions 
Frere ee 
Fil) if [fr —An@)| < K, 
Fn®(X) = \ A(x)+K if fa(x)—Mn(x) > K, 
h,(x)-K if f,0)—-h,(x) < —-—K. 


by the equations 


Clearly 
[iGb= ideo eK (ax e.bp en =, 2.2). 


It is also easy to see that the functions f* are continuous and that 
f* +f. Finally, by considering the various forms that f*,,(x) —fn(x) 
may take we can show that f;(x) increases with n. (Exercise 6(/), 1.) 
Hence 


A(f-2)| = |lim ACf#)—lim AG,)| = lim [ACF —h,)| < ALK. | 


Proof of theorem 6.61(i). The functions h,,h,(a < x < b), hy 


— nye 0 for G4 F< bs 
py ail ing gt <=, 
; lo lot & < £=°0, 
Afi) =. | for a= <0 


all belong to C,, [a, b]. This is obvious for h, and h,; in the case of h, 

we need only remark that —h, = lim ¢,, where, for n > 1/(b—x), 

¢,(t) = —1 in [a, x], ¢,(¢t) = 0 in [x+(1/n), 5], and ¢, is linear in 

[x, x+(1/n)] (Gin the elementary sense—see note at foot of p. 66). 
Let g on [a, b] be the function such that 


g(x) = Ah) (aa: x = by, 


in particular, g(a) = 0. 
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If a dissection Z of [a, b] has the points of division 
@= 46 <4, 4k hee Sh = 


put e¢; = 1 or —1 according as g(t;)—g(t,;-.) > 0 or < 0. Then 


V(2, 8) = 3 edigtt) atid} = A{ S ehh} < (Alb 


= i= 


since S efhe (x) —he,_(29]] = 1 
i=1 


fora < x < b. Thus g is of bounded variation on [a, 5]. 

Now let f be any member of C [a, b] and take e > 0. Since / is 
integrable with respect to the variation functions p,, g, of g and is 
also uniformly continuous on [a, b], there is a dissection J of [a, D] 


i 
given by fa ee ee eX Xx, = O 


which has the following properties: 
b b 
5,5 p)-[ Faby <> S,L,4)-| faty <i (6.62) 
and M,-—m,; <= € (i — l, eeeg Nn), (6.63) 


where M,,m,; denote the supremum and infimum, respectively, of 
f in [x;-1, x;]. By (6.62), 


3 Meleex)-e(eD)- [fdr |< 26 (6.64 
while, by (6.63), the step function 
= & Mille, hy.,) 
is such that SUP | f(x) — ¢(x)| < € 
and therefore |A(f)-A($)| = |A-9)] < [Ale (6.65) 
But 


AG) = E MifAhz)—Mty,.)} = 3 Mileed-s%i-v} 


b 
wed co. 16 6a), A) — | fag rie 
Using also (6.65) we finally have 
b 
ac [fas] < (Al +2) 


which proves (6.61). | 
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Exercises 6(/) 
1. Prove that the sequence (/*), defined in part (iv) of the proof of the lemma, is 
increasing. 


2. A necessary and sufficient condition for the function ¢ of bounded variation 
on [a, 5] to be such that b 
(rap =o 
a 


for all fe C[a, b] is that d(b) = d(a) and g(x) = (a) when x is a point of 
continuity of ¢ in (a, b). 

Prove the necessity of this condition by considering the functions f, f,s 
defined as follows: 

(i) f(t) = 1 fora < t < bd; 

(ii) when x is a point of continuity of ¢ in (a, b) and 0 < 0 < b—x, f, s(t) = 1 
fora<t<x,f,5(t) = 0 for x+06 < ¢t < band f, g is linear in [x, x+ 4]. 

Prove the sufficiency of the condition by using exercise 6(e), 2. 


3. Let c be a fixed point in the interval (a, b) and define the linear function 
A: C [a, b] > R} by Af) = f(c). 


Enumerate all the functions g which have the property that 


ae : 
uA = | fie eC. 


6.7. The Riemann integral 

In the rest of this chapter we develop the theory of the Riemann 
integral. Our first task is to reconcile the definition of the integral 
given in C1 and the definition in terms of upper and lower integrals 
given in §6.1. The instrument for this is Darboux’s theorem, a proof 
of which was indicated in C1 (125). The reader is reminded that, 
given a dissection 9 of an interval [a, b], u(D), the mesh of J, is 
the length of the largest subinterval; and S(QZ, f), s(Z, f) denote the 
upper and lower Riemann sums of a bounded function / on [a, 5]. 


Theorem 6.71. (Darboux.) If the function f on [a, b] is bounded, then, 
as (2) - 0, 


fo b 
OsaN+(s NF 


Proof. Let Z be the dissection of [a, b] given by 
G@= Xo ty <1 Xe Se SO 


and let ZY’ be the dissection with additional points of division 
x, x®, ..., x, where 


Xp < KO < RO eee ee 
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Denote, as usual, by m, M, m;, M, the infima and suprema of f on 
[a, b] and on [x;_;, x;] respectively. Also let M%, M@, ..., M@t» be 
the suprema of f on the k+1 additional subintervals of Z’. We then 
have 


0< SF)-S) 
= M,(x;—X;-1) 
— {M(x — x, 1) +$ M@(x® — x) +... + MED x, — x®)} 
< Mix; —x;-1) —m(%;-— X41) < (M—m)(x;— x;-1). 


Hence, if Y* is any dissection obtained from ZY by adding new 
points of division, 


0 < S(Z)—S(B*) < (Mm) 2* (x;-X)-2), (6.71) 


where &* ranges over the intervals containing new points of division. 
Now, given e > 0, there is a Y, such that 


S(D_) < [7+ é. 


Let n, be the number of subintervals of DZ, and take 4d so that 
0.< 6 < e/ny. 


Let Z; be a dissection with “(G,;) < 6 and let Z* be the dissection 
with all the points of division of Z, and of Y;. Then 


S(D*) < S(F_) 
and, by (6.71), 


0 < S(D;)—S(G*) < (M-—m)nob < &(M-—m), 
since at most my points are added to Z; to form Y*. Therefore 
[7 < SD) < S(B*)+e(M—m) 
< S(D,)+e(M—m)< [fret +M-—m), 
1.€. [7 < S(F;) < [f+ e(1+M—m). 
This proves (i). The proof of (ii) is similar. | 


Theorem 6.71 does not extend unrestrictedly to the Riemann- 
Stieltjes integral; but the analogue is valid if the integrand or the 
integrator is continuous. (See exercises 6(a), 9; 6(g), 1 and 2.) 
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The next theorem establishes the equivalence of the two definitions 
of the Riemann integral. In it we use the standard notation 


Gx, < Xone ky ee 


for a dissection D of [a,b] and, for i = 1,...,n, we allow &; to 
range over all points of [x;_,, x;]. 


Theorem 6.72. 
(i) If f is integrable over [a, b], then 
n b 
EExd > [fas WF) 0. 


@ YF  THEE-—md>e as MD) +0, 


b 
then f is integrable over [a, b| and | j= ¢. 


Proof. 
(i) For every set of &,, 


(2) < ¥ fEIO- x) < SD) 


b b 
and, by theorem 6.71, s(Z) > | ff S@A- | f as “F) +0. 


(ii) Take any ¢ > 0. Given J, we choose, for each i, the points 


é,, & so that 
Fé) > M;-e and f(&) < m te. 


Then E MEN Xa) > S(B)—(b-<) 
and EME) < (D+ eb—a), 
so that zs 


EAD) -0-@) < 8D) < [Ff 


fb a 
< | f< 1B) < 3 MEM +O-4). 
Letting “(Z) > 0, we obtain 


b (ob 
o—e(b—a) < | f< | f< o+eb—a). | 


6.7] THE RIEMANN INTEGRAL 181 


Exercises 6(g) 
1. Show that the analogue of Darboux’s theorem holds for the RS integral with 
a continuous increasing integrator. 


2. Give an example of an increasing function g on an interval [a,b] and a 
function fe R(g; a, 6) such that lim SQ, f, g) does not exist. 
1(D)—>0 ‘ 


3. Let f be continuous on [a, 5]. Show that fis of bounded variation on [a, 5] if 


and only if lim V(Q,f) exists; and that, if the limit exists, then it is V2(/). 
p(Z)—0 


4. Give an example of a function f which is of bounded variation on [a, 5], but 


such that lim V(QY, f) does not exist. 
u(D)—0 


5. Deduce from theorem 6.71 that, ifa <b <c, 


fre pen fir moe (le fir fi 


6. The function f is continuous and strictly increasing on [a, b]. Show that, if 


f(a) = 4, f(b) = f, then ; ; 
[ir | ff = bB-aa. 


6.8. Content 


It is easy to see that a bounded function with only a finite number 
of discontinuities is Riemann integrable (see exercise 6(h), 1). We 
show in the present section that this remains true for suitably 
restricted infinite sets of discontinuities. The extension of our results 
to multiple integrals will play an essential part in chapter 8. 


Definition. Let X be a set. If E is any subset of X, the function 
Vegi X — R' given by 


1 E, 
Soe | for x€ 


0 forxeX-E 


is called the indicator (or characteristic) function of E. 

In this section we always take X to be R’. If, for instance, EF = [«, /}], 
( 1 a= x = f, 
Xia, (xX) = 
lo for x <:aor x > f. 


Now let [a, b] > [a, A]. Then 
[. Xta, pl) dx = B —. (6.81) 


b 
(In particular, { \¥z = 0 when E has just one point in [a, 5].) For, 
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ifa<a<f£<b and @ is a dissection of [a, b] containing «, 
as points of division, then 


: res 
p-a = s(D) <| Xa, fl < 3 Xa, AS S(Z) sa p-a+6+0, (6.82) 


where 6, 0’ are the lengths of the two subintervals of Z adjoining 
[a, 6]; and, ifa = aor Pf = BD, then (6.82) holds with 6 = O or 0’ = 0. 
It is proved similarly, or from (6.81), that 


b b b 
[ xen=[oxan=f xan =P (683) 


Also, if E is an arbitrary bounded set in R? and [a, b], [a’, b’] are any 
intervals containing EF, then, by the lemma preceding theorem 6.16, 


J 
{ xe = [. XE» [xe = [xe 


This consideration and the identities (6.81) and (6.83) lead to an 
extension of the notion of the length of an interval. 


Definition. Let E be any bounded set in R'. The outer Jordan content 


c(E) of E is the common value of |, Xx for all intervals [a, b| containing 


E; the inner Jordan content c(E) is the common value of [. Xx. If 


C(E) = C(E), this number is called the Jordan content of E and is 
denoted by c(E). 
If the set E has content, then xz is integrable over any interval 


b 
[a, b] containing F and | ‘Vn has the same value for all such intervals. 


We have shown that the content of a finite interval is its length. 
An example of a bounded set that does not have content is the set of 
rational points in a finite interval. 

Although we have defined content in general, in this section we are 
mainly interested in sets of zero content. 


Theorem 6.81. The bounded set E has zero content if and only if, 
given e > 0, there is a finite number of open intervals, or a finite 
number of closed intervals, which cover E and whose total length is less 
than €. 
Proof. Let E < [a, 5]. 
b 
(i) Suppose that c(£) = | Xn = 0. Then, givene > 0, thereisa D 
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such that S(Z, vz) < ¢; and S(Y, xz) is clearly the sum of the 
lengths of the closed intervals [x,;_,, x;] which contain points of E. 

Each [x;_1, x;] may be replaced by a slightly larger open interval 
so that the total length of the open intervals is still less than e. 

(ii) Suppose that, given e > 0, there are open intervals J,, ..., In 
covering E and of total length less than e. Since a pair of overlapping 
intervals may be replaced by a single open interval, we may assume 
that J,, ..., [,, are pairwise disjoint. Their end points (between a and 
b) determine a dissection J of [a, b]. Since [a, b]—(, U ... U Lm) con- 
tains no points of E, S(Z, Xz) < e. Hence c(£) = 0. 

If we had started with closed intervals, we could have enlarged 
them slightly into open intervals. | 


That every finite set has zero content followed already from (6.81). 
It is now easy to see that a bounded infinite set with no more than 
a finite number of limit points has zero content. (See exercise 6(h), 3.) 
For instance, if E = {1, 4, 4, ...}, c(E) = 0. 


Theorem 6.82. If f is bounded on [a, b| and the set E of points of dis- 
continuity of f has zero content, then f is integrable over [a, b]. 


Proof. Given ¢ > 0, there is a finite number of pairwise disjoint 

open intervals J,,..., [,,, of total length less than e, such that 
G = aU mace ULn — E. 
The function f is continuous on 
H = [a, b]—G = [a, |] nG’ 

and, since H is bounded and closed, fis uniformly continuous on H. 
Thus there is a 6 > 0 such that | f(x) —f(x’)| < ¢ whenever x, x’ ¢ H 
and |x—x'| < 6. 

Now let F be a dissection of [a, b] which contains all the end 
points (in [a, b]) of G,..., , and all of whose subintervals in H are 


of length less than 6. If the subintervals of D are [x;_, x;](i = 1, ..., 2) 
and M,, m,, M, m have their usual meaning, 


S(P)—-s(F) = x (M;,—m,) (x;— X;-1) 


= (g+2y) (M;—m,) (%;—X;-1), 
where X, and 2, run over the subintervals in G and H, respectively. 
Clearly 
Xq(M;—m,)(x%;— Xi) < Uq(M—m)(x,— X41) < (WM — mye 
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and 
Dy (Mj—m,;)(X%;— Xi) < DHee(Xj;— Xi) < e(b—a). 


Thus S(D)-s(Z) < (M-m+b-a)e. | 
Theorem 6.83. Suppose that f and g are bounded on |a, b| and that 


F(x) = g(x) except when x belongs to a set of zero content. Then either 
both f and g are integrable over [a, b] or neither is integrable; and, if f, 


g are integrable, : ; 
J:f> Joe 


Proof. All that is necessary is to show that h = f—g is integrable 
and [a = 0). 
Let E be the set of points £ such that h(€) + 0 and let M, m be the 
supremum and infimum of h/ on [a, b]. Then 
Mxx(x) < h(x) < Mxz(%) 


fora < x < band so, since c(E) = 0, 


b b b Tb b b 
o=m[ xe= [mes fs ['h< f Mxe-M[ xe = 0.| 


The theorem shows that, as far as integration is concerned, sets 
of zero content are immaterial. It is therefore legitimate, and some- 
times convenient, to leave a function undefined in a set of zero 
content. For instance 

Ae 
| sin — dx 
0 X 


has a definite meaning without the integrand being defined at x = 0: 
whatever value is assigned to the integrand at 0, the integral exists 
and its value remains unaltered. The next theorem similarly illustrates 
the utility of this feature of the Riemann integral. 


Theorem 6.84. Let the function f be continuous on the closed interval 
[a, b] and differentiable in the open interval (a, b). If f' (which need not 
be defined at a or b) is integrable over [a, b], then 


[fr =s0-fa. 


(This is a stronger result than theorem 7.63 of C1.) 
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Proof. Let Z be the dissection of [a, b] given by 
fs, <7, << 3 < X= P. 


By the mean value theorem, 
S(b)-f(@ = Py {f0a) -fOui-vi = 2 F'(Ei) Xi — Xia), 
where x,_, < & < x; i = 1,..., 7). Since f”’ is integrable, theorem 


b 
6.72(i) shows that the right-hand side tends to { f' as WZ) +0. | 


In the above proof the use of theorem 6.72(i) (and so of Darboux’s 
theorem) is convenient, but not essential. (See exercise 6(h), 10.) 

Theorem 6.84 means that, if f’ is integrable, then fis an indefinite 
integral of f’. It therefore shows that theorem 6.21 A is included in 
6.21B. Equally, the following convenient extension of theorem 6.21A 
is only a special case of exercise 6(e), 1. 


Theorem 6.85. Let f < R (a, b) and let g be continuous and of bounded 
variation on [a, b]. If g is differentiable in (a, b) and g' € R (a, 6), then 


fe R(g; a, b) and (rae : ez 


Exercises 6(/) 


1. The function f on [a, 5] is bounded and is continuous except at a finite 
number of points. Prove, without using the notion of content, that / is integrable 
over [a, 5]. 


2. Show that, if [«, 2] < [a, b], then 
b 
| Xa, p) = pa. 


a 


3. Show that a bounded infinite set with a finite number of limit points has zero 
content. 


4. Given any set E, let E™ be the set of limit points of E and, for n = + See 
define E™ to be the set of limit points of E”-». Show that, if E is bounded and 
E™ is finite, then c(E) = 0. 


5. Let E,, E, be subsets of R!. Prove that 
Xe, ye) = max (Xz), X20), Xz... 2,0) = min (Xz,0), Xz,0)). 
Assuming the identities 
max (a, b) = 4{(a+b)+ |a—b|}, min (a, b) = 4{(a+b)— |a—5]}, 
show that, if c(E,), c(E») exist, then so do c(E, U E,), c(E, nN E,) and 
c(E, U E,)+ c(E, 9 E,) = c(E,) + c(E,). 
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6. Prove that, if c(E,) = ... = c(E,) = 0, then c(E,U...U E,) = 0; and show 
that the relation c(E,) = 0(n = 1, 2, ...) does not imply that c(Z, U EU...) = 0. 


7. The functions f, g are bounded in [a, b] and f(x) = g(x) in [a, b] except for x 
in a set of zero content. Prove that 


Jr- Jie [ir- [i 


eee 


sin ———. 
aati 1—nx 


oS. Por + = 1.4. ¢. ... 


Show that fis integrable over [0, 1]. 


9. The function / on [a, 5] is given by 
0 if x is irrational, 
I/q if x = p/g, 


where p,q are coprime integers. By finding a sequence of integrable functions 
converging uniformly to f, or otherwise, prove that f is integrable over [a, 5]. 
(It was shown in §3.1 that f is discontinuous at all rational points of [a, 5]. 
The set of discontinuities is therefore not of zero content, which shows that the 
converse of theorem 6.82 is false.) 


f(x) = 


10. Prove theorem 6.84 without using any results from §6.7. 


11. The function f on [0, 1] is given by 
0 for’x = 0, 
xsin(logx) for0O<x <1. 


f(x) = 


Prove that fis differentiable in (0, 1], but not at 0, that f’ is integrable over 
[0, 1] and that 1 
I fat 
0 


12. Show that the function f on [0, 1] given by 


0 for x =U, 
f(x -| 


7 
x*sn— forO0<x <1, 
2x" 


is differentiable in [0, 1]. Show also that f’ is not integrable over [0, 1], but that 


1 
f’ exists and is 1. 
0+ 


13. The function f has a continuous derivative in the interval [a, 5]. Show that f 
is the difference of two increasing functions with continuous derivatives in [a, 5]. 
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6.9. Some manipulative theorems 


The first theorem of this section is the most useful version of the 
formula for integration by parts. 


Theorem 6.91. Let f,g be integrable over [a,b] and let F,G be 
indefinite integrals of f, g respectively, i.e. functions given by 


F(x) = [foa+x, G(x) = [-sar+z (a<x <b), 
where K, L are arbitrary constants. Then 
b b 
| fG +{ Fg = F(b)G(b)— F(@)G(a). (6.91) 


Proof. Since F,G are continuous and of bounded variation 
(exercises 6(a), 10; 6(d), 4), the result easily follows from the theorem 
on integration by parts for the RS integral (theorem 6.54) and from 
theorem 6.21B as extended in exercise 6(e), 1. However a direct 
proof is also desirable. 

We first note that the continuity of F and G ensures the existence 
of the two integrals in (6.91). 

For any dissection 9 of [a, b] given by 


a=2g =e =.= 5a <%=, 
we have 


= G(x;) a I (x)dx+ > F(X;-1) {* g(x)dx 
t=1 Li—1 i=1 Li—1 


E Ge Fe)- Fmd} + E Fei-dGO)- Ga} 


E (Fee) Ge) - Fi) 2} 

= F(b)G(b) — F(a)G(a). (6.92) 
We now show that : 

5 G(x,) | S fix)ax > { : G(xfidx as wD)->0. (6.93) 


The function G is uniformly continuous on [a, 5], i.e. given e > 0, 
there is a 6 > 0 such that 


|G(x) — G(x’)| < ¢ whenever x,x'e[a,b] and |x—x'| < 6. 
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Thus, when “(Y) < 6, 


J eZ acta y G(x;) ew (x) dx 


n Li { 


x | (Ge) Gad} FO) ae] 


1 


< ES] 16()-G(x)| /elax 
< Ef" elfe@lax = ef [scold 


This proves (6.93). Similarly 


3 Fa) [" seas [ Radgidr as (+0. 6.94) 


Letting “(Z) > 0 in (6.92) and using (6.93) and (6.94) we obtain 
(6.91). | 

The formula (6.91) with continuous fand g was proved in C1 (130) 
and has been used by us on previous occasions. The proof under these 
assumptions is very simple, but the strength of theorem 6.91 lies in 
the hypothesis of mere integrability. 


Theorem 6.92. (Change of variable.) Let f be integrable over [a, b] 
and suppose that ¢ on [«, £] satisfies the following conditions: 
(1) @ increases on [a, 2] and 


g(t) =a, $f) = b; 


(ii) $’ exists in and is integrable over [a, f). 


Then 
[r= [ vooe 


i.e. [ 709 a = [ ree" dt. 


Proof. When ¢ is strictly increasing this is easily deduced from 
theorems 6.17 and 6.21A. A proof of the theorem as stated, and 
independent of the Stieltjes integral, proceeds as follows. 

Let @ be the dissection of [a, 5] given by 


GS Ag Oa a Be ex, = B. 
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For each i, choose a ¢; such that d(7;) = x,. The ¢; are not necessarily 
unique (unless ¢ is strictly increasing), but we always have t;_, < t. 
For simplicity, take tp = «, t, = £. Note that, by theorem 6.84, 


ti 
ea = MMi) = [, #@a. 
As usual, let 


m= inf f(x)= inf Rico} 


Li—-1 SLL tj-1<t<t 
Me= sip 5 IG) = Sup i {A(t}. 
VM—-1R FSH 4-7 ft 


Then, since ¢’(t) > O fora <t< £, 


(9, f) = & mdxi-xea) = & [mpd 
n (ti B 
< 3 [' Ae@e'oa = | PEO}¥' War 


< | fo@eod = 3 |" sO} Oar 


<3 © MP Oat = & Mdxi-x.3) = MB.) 


ti-1 


b 
Now s(QZ, f) and S(QY, f) can be taken arbitrarily close to | f. Hence 


B b 
| (fo ¢)¢’ exists and is equal to | te} 


A similar theorem holds for a decreasing function ¢ and therefore 
also for a piecewise monotonic ¢, provided that fis integrable over 


fe dl,where 6s inf gf), d= sup o(. 
a<t<f 


a<t<f 


There are other sets of conditions for changing the variable. For 
instance, when / is continuous, ¢ need not be piecewise monotonic 
(see exercise 6(Z), 1). 


If f is integrable over [a, b], then there is a number A between the 
infimum and the supremum of / on [a, b] such that 


[7 = A(b-a). 


This rather obvious statement is usually called the first mean value 
theorem. However the second mean value theorem is more interesting. 
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Theorem 6.93. (Second mean value theorem.) Suppose that g is 
integrable over [a, b}. 
(i) If f is non-negative and decreases in [a, b], then there is a & in 


[a, b] such that . z 
[ % = f(a) [-s 


(11) If f is non-negative and increases in [a, b], then there is an n in 


[a, b] such that . ; 
[4-10 s. 


Proof. 
(i) Let K = sup |g(x)|, 
a<x<b 


so that g(x)+K > 0 for a < x < b. Then, for any dissection J 


Iven b 
e y d= 3g 6X <3 Se See 


[ f+% = z . fig+K) < E fe) | (g+K) 
a 4=1J xi-1 1=1 =1 
= Efe |" gtk 3 fedex 699) 


If ax) = [°s, 


then, since G(a) = 0, f decreases, and f(b) > 0, 


Xi 


oF (X;-1) & 


Li—1 


E SOD{G)- Ge v} 


E fade) E fed) 


E MO) GH)+/OEW 


IN 


E (fed -Se)} sup GO) +/0) GO) 
= {f@-S0)} sup, G3) +f) GO) 
Sahai 


Alo ft d@i-xd>[ fF as m0. 
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Hence, letting “(Z) > 0 in (6.95) we obtain 
[[Fe+0 <s@ sup Ga) +K F 
i.e. | : fg <sf@ a G(x). (6.96) 
Applying (6.96) with g replaced by —g we have 
[ (-2) < /@ sup, {Ge} = —fla) int G9, 


b 
so that | fe = f(a) inf G(X). (6.97) 
a axx<b 


Since G is continuous, (6.96) and (6.97) show that there is a § in 


[a, b] such that . z 
[ % -1@6® - 40 |’. 


(ii) This can be proved by an analogous argument or by applying 
(i) to f(b — x) and g(b—x). For then there is a £ in [0, b—a] such that 


[° “po-ns-aas = © [ e0-ae 
and theorem 6.92 (with decreasing ¢) gives 
[-fos@ae = £0 | eae 
where 7 = b—£. | 


Corollary. If g is integrable over {a, b] and f is monotonic in {a, 5}, 
then there is a & in [a, b| such that 


[2% = f(a) [ 2+fO [-e. 


Proof. Suppose that f decreases. Then f—/(d) is non-negative and 
decreasing so that, by the theorem, there is a & in [a, 5] such that 


[’ U-rons = (1-70) |’ 


i.e. ie = fa) ['¢+f0 [['e-f | = f(a) [s+1O ['s 


Note. Theorem 6.93 may be extended to RS integrals with con- 
tinuous integrators. (See exercise 6(i), 4.) 
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The second mean value theorem is often used to establish the 
existence or uniform convergence of improper integrals. The analogues 
for integrals of Dirichlet’s and Abel’s tests may be deduced from it 
(see exercises 6(i), 5,6), but the following example shows that a 
direct attack is simpler. 


Example. The integral 


| as (6.98) 
0 4 

Convergence of the integral. The integral converges for all values 
of y. To prove this we first remark that the integral is not improper 
at 0, since the integrand is continuous and bounded on (0, 00). Also 
we need only consider the case y + 0. Then, when ‘ > Ae D, 


{= XY 


1 
ds = Pal sin aydx| < YX rae | (6.99) 


and 2/(X,|y|) may be made arbitrarily small for all sufficiently 
large X;. The general principle of convergence (theorem 6.31) 
therefore shows that (6.98) exists. 

We had previously proved the existence of the integral (with y = 1) 
by using integration by parts. In fact both methods are frequently 
applicable. 

Uniform convergence of the integral. The integral converges uni- 
formly in any set of the form | y| > c > 0. For, by (6.99), 


sup /{ * : 
i oe 


ly|z>e =i C 


and we can now use the general principle of uniform convergence 
(theorem 6.34). 

The convergence of (6.98) is not uniform in any interval containing 
the point y = 0. Consider, for instance, the interval [0, 1]. Take any 
X 2 1 and let u = 7/(4X). Then 


2X sin XU 2X sin da 
E as > | = dx = 55 log 2. 
Therefore, for every X > 1, 
2X sin xy 
beet {; x om > ae 


and theorem 6.34 shows that the integral does not converge uniformly 
on [0, 1]. 
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Value of the integral. We prove that 
—in for y <Q, 
° sin 
[A ax = 0: for.» = 0, 
0 x 
in for y> 0. 


Given y, denote the value of the integral by, d()). It is clear that 
3(0) = 0 and ¢(—y) = —¢(y). Moreover, if y > 0, 


= Ge 
| EY ae = [SE dt > (0) as XY >, 
0 Xx 0 f 


and so ¢(y) = (1). Hence we need only show that ¢(1) = 47. 
Per n= 1, 2; .., put 


27 sin 2nx 


1 
: erie 

tes i} sin 2nx cot xdx, v, = : ax. 
0 0 


The following three steps establish the required identity: 
Gy uy, =. 47; (ii) lim (wu, —v,) = 0; (iii) lim v, = @(1). 
(i) is proved by induction. We have 


$n 
u, = i} 2 cos? xdx = 47; 
0 


4n 
Unii—Un = | 2 cos (2n+1)x cos xdx 
0 


4a 
= i} {cos (2n+2)x+cos 2nx}dx = 0. 
0 


(ii) Since x-1—cot x and —(x~*—cosec x) converge as x > 0+, 
by theorem 6.84 the former is an indefinite integral of the latter. 
Hence, by theorem 6.91, 


40 
V,—Un = | sin 2nx (5 —cot x] dx 
0 Xx 


cos 2nx /1 37 §=37™cos 2nx (1 
ee ee er. ee — [s—cosec? x ax 
2n = 0 0 2n x? 


>Qasn->o. 


(iii) follows from the identity 


nm sin t 
Un — | Aa ea dt. 
0 [ 
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Exercises 6(Z) 


1. Suppose that the functions f, ¢ satisfy the following conditions: 

(i) ¢’ exists in and is integrable over an interval [«, /], 

(ii) f is continuous on [c, d] where c, d are, respectively, the infimum and 
supremum of ¢ in [a, /]. 

Show that, if d(«) = a and ¢(/) = 5, then 


[ ee 2 (fo $y 


(In C1, 130, 6’ was taken to be continuous.) 


2. Frullani’s integral. The function f on (0, ©) is integrable over every interval 
[6, X](6 > 0) and f(x) > las x > 0+, f(x) > Las x > «. Show that, ifa, b > 0, 


then oe) 
od ccleal es ee ee log. 


O+ x 


3. Prove the following versions of the second mean value theorem and its 
corollary: 
Under the conditions of theorem 6.93 (1) there is a & in [a, b] such that 


i: “£29@® ['« 


Under the conditions of theorem 6.93 (ii) there is an 7 in [a, 6] such that 


[x = fo-) |e 


Under the conditions of theorem 6.93, corollary there is a € in [a, b] such that 
b 4 b 
[@=s0+) [2 + f6-) |e. 
a a 4 


4. Show that the second mean value theorem holds for RS integrals with con- 
tinuous integrators of bounded variation: if g is continuous and of bounded 
variation on [a, b] and if f is non-negative and decreasing on [a, 5], then there is 
a number & in [a, b] such that 


b E 
| fie= fo) [ dc = f@isO* or 


Give an example to show that the result becomes false if the condition that g is 
continuous is omitted. 


5. The functions f, g on [a, 0) are such that 
M4) 
(i) [ f exists and is bounded for x 2 a, 
a ’ 
(ii) g is monotonic on [a, 0) and'g(x) > 0 as x > ©. 
[o @) 


Prove that fg exists. 
a 
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6. The functions f, g on [a, ©) are such that 


(i) { f exists, 
a 
(ii) g is monotonic and bounded on [a, 0). 


foe) 
Prove that { fg exists. 
a 


7. Show that 
| sin (x*) dx 
1 


converges when |«| >1 and diverges when |a| <1. 


8. Show that all the following integrals converge: 


(a) { - sin x = (b) x sin x 


9 x+sinx 0 oe ee 


ie.) 2 fee) 
(c) { ricisee dx, (da) | cos x ees — dx. 
2 logx 0 x 


9. Show that 
eat 
S 
| in x zo 
0 x+y 
converges uniformly for y 2 0. 


10. Show that 
- BSS 
{ e-*” sin — dx 
0 y 


converges uniformly for y > 0. 


NOTES ON CHAPTER 6 


§6.1. Riemann’s theory of integration was published posthumously in 1867. He 
defined the integral as lim & f(&;)(x;— x;_1) (cf. theorem 6.72); upper and lower 
sums and integrals were introduced subsequently by Darboux. Stieltjes’s idea 
of integrating one function with respect to another was initially hidden in a long 
and now famous paper of 1894 on continued fractions. The importance of the 
integral was recognized only fifteen years later when F. Riesz used it in his 
representation theorem (6.61). 


[o9) 
§ 6.3. It is easy to see that, if fdg exists, then 
—-2o 


P.4 
lim fdg (*) 
xo 


[ee] c 
exists and is equal to = fdg. On the other hand, (*) may exist without [ tdg 
Cc 


existing (e.g. when f(x) = g(x) = x). In this case (*) is called the principal value 
of - fdg and is denoted by (P) iM dg. 
co 
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b 
Similarly, if f is unbounded near the point ué(a, db), (P) i fdg is defined as 
a 


u—h b 
lim (| +| ) fae 
h-0+ a uth 


b 
when this limit exists, but the improper integral { fdg does not. For instance the 


a 

1 4 

improper integral | , dx does not exist, but (P) | 2 dx does. 
a | = 

$6.4. The concept of a function of bounded variation is due to Jordan (1893) 

who used it for the characterization of curves of finite length. (See §8.1.) 


§6.6. Riesz’s theorem 6.61 has been followed by many similar theorems in- 
volving other types of spaces. For instance let / be the space of complex 
sequences x = (x,) such that 2 |x,| converges. Then every complex -valued 
linear function f on / is of the form 


f(x) = XenXn, 


where the sequence (c,,) is bounded. This representation theorem and others are 
given by Banach in his book Théorie des Opérations Linéaires (61-68). 


§6.8. It is pointed out in exercise 6(/), 9 that the converse of theorem 6.82 is 
false. However a modified form of the condition in the theorem is both necessary 
and sufficient for Riemann integrability. A set E in R‘ is said to have zero 
measure if, given ¢ > 0, there is a countable set of intervals (a, 5,) such that 


U(a,, bn) > E and X(b,—a,) < ¢. 


Theorem 6.81 shows that every set of zero content also has zero measure. On 
the other hand, a set of zero measure need not have zero content; the set of 
rational numbers in an interval is of this kind. The improved version of theorem 
6.82 is that a bounded function is Riemann integrable if and only if its points of 
discontinuity form a set of zero measure. 

It is curious that such a satisfying result on the Riemann integral should use a 
concept from Lebesgue integration. For measure, of which we have given the 
simplest example, is a generalization of content and it is related to Lebesgue 
integration in the same way as content is to Riemann integration. The essential 
advance which the idea of measure marks over that of content is that content is 
only finitely additive, whereas measure is completely additive. By this is meant that, 
if E,, ..., E, are pairwise disjoint sets with content, then 


ch, U3. UE = 2) +... tee 
(see exercises 6(h), 5, 6); but, if (E,) is a sequence of pairwise disjoint sets possess- 


ing measure, th 
eee mie = ee 


(where m(E) denotes the measure of E). 

The Riemann theory of integration is attractive, simple, and adequate for 
many purposes. However the Lebesgue theory, though more elaborate, provides 
not only a more powerful integral, but one which in many ways is also simpler to 
manipulate. Since 1902, when Lebesgue published his first results, a good many 
other integrals have been developed to solve various specific problems in analysis 
but none has that unique combination of properties which makes the Lebesgue 
integral pre-eminent. For a short account of the Lebesgue integral see J. C. 
‘Burkill, The Lebesgue Integral or R. G. Bartle, The Elements of Integration. 


7 
FUNCTIONS FROM R” TO R" 


7.1. Differentiation 


It is desirable to express a number of results in this chapter by 
means of matrices. We shall therefore conform to the practice in 
matrix algebra of regarding points in R” (vectors) as column matrices. 
If x ¢ R”, we denote the components of x by x, ..., X; and, to save 
space, we write x = (X,, ..., X»,)", where the superscript T indicates 
transposition. Similarly, a function f with values in R” is written as 
(fi, ---»f,)". The origin of any space R™ will be denoted by @. (It 
will always be clear from the context to which space the symbol 6 
refers.) Since subscripts are now used to identify coordinates, we 
shall often use superscripts to designate different vectors. For 
instance a sequence of vectors may be denoted by (x', x*, ...) or Ege 

Our aim is to define the process of differentiation for a function 
from R™ to R”. Before doing so we consider the particular cases 
m= tandn = |. 

Vector valued functions on (subsets of) R1 present no difficulty. 
If X is a subset of R! and € is an interior point of X, the function 
f = (fo ea fn)": X > R" is differentiable at € if 

im LEt DLO) 
0 h 
exists, i.e. if there is a vector f’(€) such that 


i! (E+h)—f(S)-f(E)h 
h 


>0O as h-O. 


Evidently f’(é) exists if and only if f4(8), ...,f;,(&) all exist, and 
F'(E) = £18), «so Fn(8))7- 


Note that f’ is again a function from R’ to R”. 

For functions from R” to R! the notion of a partial derivative 
is familiar. If we regard partial differentiation as differentiation along 
a line parallel to a coordinate axis, we are immediately led to the 
idea of differentiation in an arbitrary direction. 


Definition. Let & be an interior point of a subset X of R™ and let u 
be a unit vector in R™ (i.e. u = (Uy, «++ Um)" is such that 


jul] = J@i+...t+un) = 1). 
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Given the function f: X > R', if 
f(E+ tu) —f(é) 

t 


lim 

t—0 

exists, then this limit is called the derivative of fin the direction u and 
is denoted by D,,f(&). 

Clearly the partial derivatives of f are the derivatives in the 


directions 

lees (10, 2. 5S = Ot ee ae 
We shall generally use the expressions D,f, Df, ..., D,,f for them, 
since the classical notation for partial derivatives can lead to con- 
fusion. 

In spite of possible first appearances, the directional derivative is 
not the counterpart of the derivative of real functions. For instance 
the existence of all directional derivatives at a point does not ensure 
continuity at that point. For let the function /: R? > R* be given by 

Se 
f(x, y) = yet" 
0 when (x, y)? = (0, 
Then, if uw2+v? = 1 and ¢ + 0, 


t 


when (x, y)? + (0, 0)”, 


u*t? + vy?’ 
[ v/v when v + 0, 
so that Du, wf, 9) = | : : 
Ww v = 0. 


However /(x, x”) = 4 for all x + 0 and so f/ is not continuous at 
(0, 0)”. 

In C1 (156) the property of differentiability of a function from R? 
to R! was defined. The definition is easily extended to functions from 
R™ to R'. If X is a subset of R™ and € = (€,, ..., &,,,)7 is an interior 
point of X, the function f: X > R} is said to be differentiable at & 
if there are constants a), ..., a,, such that 


flSrt My v9 bm Mm) =P (Bay +09 Sm) = (Galy + oF Omltm) 07.11) 
J (hi +... +h.) 7 


as /(h?+...+h%,) > 0. The motive for this definition is that, if f 
is . differentiable :atf.= (£;,...05:6,,)'..0nd. 1 7665 aoe 
then the surface x,,41 = f(X1, -.-» Xm) in R™*" has the tangent plane 


Xmar—S Ess «+> mn) — Gal — £1) — «+» — On Xm — En) = 0 
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at the point (&,, ...5 in» §m+1)7- (Here we have used the terms surface 
and tangent plane in an intuitive sense; surfaces are not, in fact, 
discussed in this book.) 

We have previously noted (see p. 40) that every linear function 
A: R” > R' is given by a formula 

a = et, £0, (C= i ee Ee) 

where Cj, ...,CGm are constants. Hence the condition (7.11) for 
differentiability can be put in the form: 


S(E+h) = +0 as || ss 
for a suitable linear function A on R” to R'. The definition of differ- 
entiability of a function from R” to R” now follows quite naturally. 


Definition. Let X be a subset of R™ and & an interior point of X. The 
function f: X > R" is said to be differentiable at & if there exists a linear 
function A:.R™ — R” such that 


LEED AON! 0 as hl >0 (7.12) 


(where the norms in numerator and denominator are those in R” and 
R” respectively); the linear function 4 is called the linear derivative of 
fat & and is denoted by Df(&). 

To justify calling Df(x) the linear derivative we need the following 
uniqueness theorem. 


Theorem 7.11. If f:X — R" (where X < R”) is differentiable at &, 
then the linear derivative of f at § is unique. 


Proof. Let A! and A? both be linear derivatives of fat ¢ and put 
Ai —A® = pw. Then, fori = 1, 2, 


IAE+A) TH — A‘(h)|| +0 as |h] +0 


and therefore, by the triangle inequality, 


AO) ae ve 
Tal ~O as |All +0. (7.13) 


Now take any xe R™. Since A! and A? are linear, so is ~ and, by 


713), 
to 
[=| 


|“@2)|| = [||] lim 
Thus A! = A. | 
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We now add a number of comments on the definition of differenti- 
ability. 
(a) It is sometimes convenient to write (7.12) in the form 


IFE+A)-f(E)-AM]|] = ori) as [All +0. 
(b) A linear function A: R! > R” is such that 
A(x) = ax (xe R}) 


for some vector a; and there is a bijection between the A’s and the 
a’s. If f = (f,, ..., f,)7 1s a differentiable function from R to R”, this 
bijection provides the connexion between the linear derivative Df(£) 
of f at £ and the derivative f’(£) = (/1(6), ..., f,())*; for Df(é) is the 
linear function on R! to R” determined by 


Df(E)(x) =F'(E)x = (FilS)% ---Fn(S)X)* (x € R?). 


We shall see later (theorem 7.13) that, when f = (f,, ...,f,)7 is a 
differentiable function from R™” to R”, there is a corresponding 
connexion between Df(&) and the partial derivatives of fi, ..., f,, at &. 

(c) We have stressed that, when /:X ~ R” (where X < R”) is 
differentiable at £, then Df(é) is a linear function on R” to R”. On 
the other hand Df is a function from R”™ to L(R”™, R”), the space of 
linear functions on R” to R®” (see §3.2). The domain of Df is the 
subset of X on which fis differentiable. 

(d) If the function /: R” > R” is constant, then clearly for every 
Ee R™, Df(&) is the zero linear function ©: R” +R” (defined by 
©(x) = 6 for all x € R”). 

(e) Let /: R” > R®” be a linear function. The identity 


K(E+h)-f(E) f(A) = 4 


shows that, at every point € of R”, fis differentiable and Df(é) = f. 
Note that this is an entirely different relation from 


(djidx) exp x = exp x (ver?) 


which is described by the equation g’ = g. 

(f) Terms other than linear derivative which are used for Df(é) 
are derivative (or total derivative) and differential, but these suffer 
from the disadvantage of having other well established meanings. 

(g) The relation (7.12) can be used to define the property of 
differentiability and the linear derivative of a function whose domain 
and range lie in arbitrary normed vector spaces. 
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It is easy to prove that differentiability implies continuity and the 
existence of partial derivatives. 


Theorem 7.12. If X < R™ and the function f: X > R" is differentiable 
at the (interior) point & of X, then f is continuous at &. 


Proof. Denote the norm of the linear function Df(é) by k (see 
eee | DAE) < lx 
for all x ¢ R™. By the definition of differentiability, there is a d > 0 
such that E+) -/O- DOI < Lal 


when ||i|| < 6. Hence 


If7E+H FO] < |DAHMM| +] < &+ DAI 
for ||h|| < 6. | 


Theorem 7.13. Let the function f = (fis -.->fn)7: X > R” (X © R”) 
be differentiable at the point & of X. Then all the partial derivatives 
Df) @ = 1, ....m3j = 1,....m) exist and the linear derivative 
Df(&) is given by 
Dy fi(8)---DmA(8) \ [%1 
Df(E)(X) = [ccc cc ccc ccc ceee : (x = (X, ..:5 Xm)7 € R”). 
Proof. The linear function Df(é):R” — R” is determined (see 
p. 40) by an equation of the form 
Gan tn: 
Die 4 sixes. 6h a Ke “© RB"). 


QAni- ° Anm Xm 


Now AE +h) -£(E) — DFE) (A)II/||4]| > 0 
as ||h|| +0. Considering the jth component in the numerator and 
taking h = te’, we have 
| FE + te*) —f(E) — ayet| [|t] > 0 
as t > 0. Thus D, f,(&) exists and is equal to a;;. | 
Note. Ifm = 1, i.e. if f = (fj, ...,f;,)” is a function from R' to R”, 


the matrix of Df(é) is the column matrix (/;(£), ...,f(8))” which has 
previously been defined as f’(£). 
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Ifn = 1, ie. if fis a function from R™ to R!, the matrix of Df(§) 
is the row matrix (D, f(&), ..., Di, f(§)). 
When m = n, the determinant 


Dy fi(S)--» Dn fil) 


Ds IA $)s Da Fulb) 


associated with the linear derivative Df(£) is called the Jacobian of f 
at € and is denoted by Jf(&). The classical notation is 


(hs eee iP) 


(S31, aa | En) : 


The fact that the linear function Df(£):R” > R” is bijective if and 
only if Jf(€) + 0 (see p. 40) is crucial for the implicit function 
theorem (7.43) and related results. The geometrical significance of the 
Jacobian is that it measures local magnification: if fis a function 
from R” to R” and E is a set none of whose points is far from &, 
then the volume of f(£) is approximately |Jf()| times the volume 
of E. This statement and the ideas it involves will be made precise 
in § 8.6. 

We have previously given an example of a discontinuous function 
whose partial derivatives exist. The converse of theorem 7.13 is 
therefore false. Differentiability, however, is ensured if the partial 
derivatives not only exist, but are also continuous. 


Lemma. The function f = (fi, ...,fn)7:X > R” (X © R”) is differ- 
entiable at the (interior) point & if and only if each component function 
f;:X > R' is differentiable at &. 


Proof. Let A be the linear function on R™ to R” determined by the 
matrix 


and let A; (j = 1, ..., 7) be the linear function on R” to R! given by 
the row matrix 
(ans «55 Ope)- 

To prove the lemma we need only remark that 


IFE+A)—f(E)—AM)|| = (|All) as [Al +0 
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if and only if, for 7 = 1, ..., n, | 
Fil +h FE) -AA| = ofA) as [All +0. | 


Theorem 7.14. Let X be a subset of R™ and let & be an interior point 
of X. If the function f = (fi; ....f,)7: X — R" is such that each partial 
derivative D,f; G = 1,....m3j = 1, .., 0) exists in an open ball 
B(é; 6) and is continuous at &, then f is differentiable at &. 


Proof. In view of the lemma it is sufficient to prove that each 
function f;: X > R'is differentiable at ¢. 
ih = hy. fe aE =O and 


kt = (h,, .%,.4g0,.5 0) =he'+:..the (= 1,...,m), 
sothat f(E+M-HO = THEM -HE+K 9}. (7.14) 


For each i, ki—k*-! = h,e* and so, by the mean value theorem, there 
is a number a, between 0 and | such that 


F(EtK)—-f(Etk >) = hD, f(Etk *+a,h,e). (7.15) 
Since the functions D, f; are continuous at € and 


. JA 4 +0,h,e*| < |k| < |Al, 
it follows that 

h,Df(E+k*1+a;h,e) = h{Df(§)+o0)} = fh, D; F() + o( Al) 
as ||h|| > 0. Hence, by (7.14) and (7.15), 


f(E+N-HO- YADA = o(lal)- | 


Exercises 7(a) 
1. Let X be a subset of R” and let £ be an interior point of X. Show that, if the 
function f = (f,, ..., fn)": X >R” is such that all partial derivatives D, f; exist 
and are bounded in an open ball B(g; 6), then fis continuous at &. 


2. Let & be an interior point of the subset X of R”. Show that, if the function 
f: X ~ R* possesses the directional derivative D, f(&), then D_,f(&) exists and 
is equal to — D,f(&). | 


3. The function f: X > R'(X © R”) is differentiable at the (interior) point 
E of X. Prove that every directional derivative D,f(&) exists and that 


Di f(S) = Df). 
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4, If the function f: X > R(X © R”) is differentiable at the interior point & of 
X, the gradient Vf(&) of fat § is defined as the matrix of Df(§), i.e. 


VA(S) = (Dif), «.-, Dn f(E))- 
Show that, if V/(é) + (0, ..., 0), then 


ae (Vf(E))" 
° Ivf 
is the unique vector such that ||v,| = 1 and 


| DFE)|| = sup |Duf(®)| = |D., fl. 
5. The function f: R? ~ R! is given by 
xy 
Hae fad when (x, y)? + (0, 0)7, 
0 when (x, y)? = (0, 0)?. 


Prove that at (0, 0)? f is continuous and possesses all directional derivatives, but 
is not differentiable. 


6. The function f: R? > R'is defined by f(x, y) = |xy| (x, y)? € R?). Prove that 
fis differentiable at 9 = (0, 0)7, but that there is no disc B(@; 0) at all points of 
which D,fand D,/ exist. 


7. Thefunctions¢d = (¢,,..., ¢,)7 andy = (h, ..., Wr)7 havea common domain 
X < R™ and ranges in R”, R” respectively. The function f: X — R”*? is defined 


= f= Chex. bats AO ee 


and we abbreviate the right-hand side to (¢(x), (x))7. Prove that fis differenti- 
able at the point § ¢ X if and only if ¢, % are differentiable at € and that, when 
there is differentiability, Df(€) is given by 


Df(é)(h) = (DEE), DYE)" (he R”). 


8. The real functions ¢, yy on R! are differentiable at the points a, b respectively. 
Prove that the function f: R? > R' given by f(x, y) = (x) 0) (, y)* € R®) is 
differentiable at (a, b)”. 


9. Find a function f: R? > R' which is everywhere continuous and nowhere 
differentiable. 


10. The function f: R? > R? is defined by 
f(x, y) = (sin x cos y, sin x sin y, cos x)? ((x, y)? € R®). 
Show that fis everywhere differentiable and that, for all (x,y)? € R’, 
| Df(x, y)|| = 1. 


7.2. Operations on differentiable functions 


Given two functions f, g from R” to R”, we may form their sum 
f+g (a function from R™ to. R") and their inner product f.g (a 
function from R” to R!). We also consider scalar multiplication of a 
function from R” to R” by a function from R™ to R}. 
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Theorem 7.21. 

(i) Suppose that the functions f: X > R" and g:X > R" (X © R”) 

are differentiable at . Then f+ g and f.g are differentiable at § and 
D(f+g)(S) = Df(S) + Dats), 
D(f.g)(€) = f(§). Da(s) +.8(£). Df(é). 

(ii) If the functions $¢:X > R' and f:X > R" (X < R”) are 
differentiable at &, then $f is differentiable at & and the linear function 
D(¢f)(&) from R™ to R” is given by 

DEO) = GOHIDAMOMI+(DEOHOCMSS) eR”). 


Proof. 
(i) The proof of the first statement may be left to the reader. To 
prove the second we write 


(f.g)(§ +h) -(7.8)()—-/E) [Ds 1+) LAMB 
= {f(§+h)—f(8) — DAS) Mj .8(E +A) 
+ {g(& +h) —g(6)— Dg(S)(A)}. FE) 
+[Df() (A). {g(é + 4) —g(6)} 
=prqrr, 
say. By Cauchy’s inequality (theorem 2.21), |g| = o(|Al|) as ||h|| + 0. 
Using also the continuity of g at £ we have |p| = o((||Al|) and 
I7| < | DFS) |All le(E+A)—-gs(|| = (|All). 


(ii) The proof is similar to the one we have just given. | 


We now turn to the composition of two differentiable functions. 


Theorem 7.22. Let X, Y be subsets of R™ and R" respectively. If the 
function f = (fi, ...sfn)4:X > Y is differentiable at the point € and 
& = (24, ..-5 By)": Y > R® is differentiable at the point f(€) = 4, then 
the composite function go f:X — R? is differentiable at § and 


D(gof)() = Dg(y)o Df(&), 
so that the matrix of D(gof)(&) is equal to the matrix product 


( vee ar ( vee “| 


e@eeeeeekeseeeeeeseesekeeeio fF fF ee ee @# @# @# # © ® © © © @ @ 
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Proof. Put Df(§) = A, Dg(7) = », so that, if 


FE +h) —-f(S)- AM) = gh) 


and 2(y +k) —29(9)—-A) = rk), 
then l|a(h)|| = o(|h]) and |r(k)| = o([k\). 
Let (gof)(§+h)—(gof)(§)—(0A)(A) = sf). 


We have to show that ||s(h)|| = o(||A\|). 
Since / is linear, 


s(h) = g(f(E+h))-e((E) — HE +4) -/(5)) 
+ e(f(é +h) —f(S) —A)) 
= r(f(E +A) -f(8) + Hh). 
We saw in the proof of theorem 7.12 that, when |/A|| is small, 
IFE+A)—FE) < (AL+ DAI. 


Since also ||r(k)|| = o(|k|]) and |7(@)| = 0, it follows that, given 
e > QO, 


IrfE+h—-f)| < el fE+A-/OI| < (Al + DA (7.21) 
for all sufficiently small ||/||. Thus 


Ir f(E+A) —f/(E)) || = o(Al). 
(Notice that in (7.21) we cannot write 
I7¢E+A) SE) = 0 FE+ —SE))- 
since || f(é +A) —f(&)|| may be 0.) Moreover 
lM) < ll lal = (ll). 


This proves that ||s()|| = o((|Al|). 

The last clause of the theorem now follows from the fact that the 
matrix of the composition of two linear functions is the product of 
their matrices. | 


Corollary. Let X be a subset of R® and suppose that the function 
f:X > R” is injective. If f is differentiable at the point € of X and f~ 
is differentiable at the point y = f(§), then Df(§): R” > R" is bijective 


ee | Df-) = [D/O 
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so that, if f;* denotes the i-th component of f—, 


Dy f°(0)--- Dn fi) D, fi(S)..» Dr fi(S)\ ~ 


Dy fi)» Da fn) Dy fnlS)-+- Dn Fal) 


Using the functions f and g of theorem 7 os put 


gof= p a (9, ne eg 7 
Then the (j, i)th element D;¢,(€) in the matrix of D¢(£) is given by 


D;$(S) = os Dz8(9) Di fi(§)- (7.22) 


The formula (7.22) is the chain rule. Now denote points in R” and 
R” by (x4, 2+) Xm)” and ()4, ..-, Yn)” respectively. Then the classical 
form for (7.22) is 0b, gy AN 


oy. in ou, Oe” 


where @y,/0x; and 0¢,/0x, are evaluated at £ and dg,/éy, is evaluated 
at y. The chain rule was given in this form in Cl (158). Theorem 
_ 7,22 expresses the rule concisely and explains its origin. 

Note. When m = 1, so that f and ¢ = go/f are vector valued 
functions on R}, then 


91(8) 


Dyg3(9)...Dngim\ (F108) 


00-006 6876.8 8 8 6. & 2 6 9 


$,(§) D,g(9)---Dn&o(M/ \Fn(S) 
i.e. $'(€) = Dg(n)(f'()). 


When m = n = 1, so that fis a real function, and g is a vector 
valued function on R}, then 


$1(8) £1 (7) 2i(1)f'(S) 
= : 1) = : ; 
$(§) gp(7) gis’) 
ive. ¢'(é) = f'(E)a'F()). 


If also p = 1, we have the formula, proved in Cl (68), for differenti- 
ating the composition of two real functions. 
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Exercises 7 (0) 
1. Given the function f: X > R" (X © R”), the function || /|| : X > R? is defined 


: Ifl@ = [foo] eR”). 


Prove that, if f is differentiable at the point £ € X, then so is |/f||, provided that 
ACS) ee 


2. Show that, if the function /: X > R" (X © R”) is differentiable at the point 
€ & X, then so is the function ||f|?: X > R'. Prove also that, if f is such that 


f|| is constant, then f@). D/O) = 0 
for all x € R™. 


3. Let X be a subset of R” such that tx €e X whenever x € X and ¢ > 0. The 
function g: X > R? is said to be homogeneous of degree « (where «& is real) if 


e(tx) = t%g(x) 
for allxe X and t > 0. 
Let X be open and let g be differentiable at all points of X. 
(i) Show that, if g is homogeneous of degree a, then each function D;g 
= (D;g1, ..., Dig,)* : X > R® is homogeneous of degree «—1. 
(ii) Prove that a necessary and sufficient condition for g to be homogeneous 


of degree a is that 
: Dg(x)(x) = a(x) 


for all x € X. 
4. The function f: R?— {0} > R? is given by 
(x, y)* 
xy=— 
ee I, »)||* 


(a real). Use the polar transformation p: R? > R?® defined by 
(x, y)* = p(r, 6) = (rcos g, rsing)* ((r, 6)? € R’) 
to prove that Jf(x, y) = (1—«)||@, y)|| -**. 
5. The function f = (/, fo, fs)" : R? > R? is given by 
File) = 2X4 —- XX, fo) = XX — HX X0Xg, f(x) = X1X9Xz. 


Prove that f is injective in X = R®—{x|x,x, = 0} and find Y = f(X). Show also 
that f-!: Y > R° is differentiable and evaluate J/—1(y) for all ye Y. 


7.3. Some properties of differentiable functions 
In the study of real functions the mean value theorem is a powerful 


tool. Its analogue fv) fl) = Df(O(w—u) 


(for suitable £) holds for functions ffrom R” to R} (see exercise 7(c), 1). 
The corresponding result for vector valued functions is false (see 
exercise 7(c), 2), but the theorem below is a partial substitute. In it 
we use the notion of convexity. A set S in R™ is convex if, whenever 
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two points belong to S, then so does the segment joining them. An 
interval, an open ball, or a line segment provide examples of convex 
sets. 


Theorem 7.31. Let X be a subset of R™ and let S be a convex subset 
of X. If the function f: X > R" is differentiable at all points of S and 
|| Df(x)|| is bounded for x € S, then, for all u,v € S, 


\F0«)—FC)| < (Sup |. DFCO|) |u—2l. 7.31) 


Proof. Take any points u, v in S and put v—u = A. If f(u) = f(r), 
(7.31) clearly holds. Otherwise let w be the unit vector in R” such that 


IOS] = 0 FO) -FU)}. (7.32) 
Now define the function ¢:[0, 1] ~ R! by the relation 
Q(t) = w.f(ut+th). 


Then ¢(0) = w.f(u), (1) = w.f(v) and ¢ is clearly continuous on 
[0,1]. Also ¢ is differentiable in (0,1) and (see note following 


theorem 7.22) (0) = 0. Dflu+th@). 
By the mean value theorem, there is a 7 in (0, 1) such that 
e(1)—-9(0) = (7), 
ie: o {f(v)—f(w} = wo. Df(u+ th) (h). 
Then, by (7.32), 
|f@)-f@)| = ©. Dftutth)(A) < | Dftu+th)|| |All 
and, since u+7h eS, this implies (7.31). | 


Under certain conditions there may also be a lower estimate for 
| f(u) —f(v)||. We need only a simple result of this kind. 


Theorem 7.32. Let X be a subset of R” and let the function f: X > R” 

be differentiable at the point &. If Jf(€) + 0, then there is ad > O such 

that 1 

+h)- > ao Cd 433 

AE +)-SOl > araR—HH IA (7.33) 
as 


whenever ||h|| < 6. 


Proof. We have 
S(E+hM)—-f(E) = DAE) +r), 
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where ||r(A)||/||4|| > 0 as ||h|| + 0. Then 


IME+M—/()| = DAH M| - |r| 
] 
2 [p> "I —|r(A)||, 
2) = |DAD-UDA/OAI| < | DAE) [DAO MI: 


and we choose 6 so that 


"| < anna [A 
for ||h|| < 6. | 


An immediate consequence of the mean value theorem is the fact 
that a real function whose derivative is zero in an interval is constant 
in that interval. Theorem 7.31 enables us to prove a similar theorem 
for functions from R” to R”. 


Theorem 7.33. Suppose that the set G in R™ is open and connected. 
If the differentiable function f:G — R” is such that, for every €€G, 
Df(&) is the zero linear function © (see p. 200), then f is a constant 
Junction. 


Proof. Let u be a fixed point of G and let x be any other point of G. 
By theorem 3.32, u and x may be connected by a polygonal arc 
lying entirely in G. Denote the vertices of this arc by 


ee ee SS eS 


Since the straight line segment joining x‘-! to x‘ is a convex subset of 
G, it follows from theorem 7.31 that f(x") = f(x’). This holds for 
i = 1,..., p and therefore f(x) = f(u). | 


In the theorems of the next section (and in others) the functions 
under consideration will not only have to be differentiable; their 
linear derivatives will also be required to be continuous. The sense 
in which continuity is to be understood is entirely natural. Let the 
function f: X > R" (X < R®”) be differentiable in the subset X, of X. 
If §¢X,, Df(é) is a linear function on R”™ to R", ie. Df(S)eE 
L(R™, R”); and, as we have previously remarked, the linear derivative 
Df is a function with domain X, and range in L (R”, R”). The set X, 
is equipped with the metric induced by R™ and the set L (R”, R”) 
has a natural norm (see (3.22), p. 41) which produces a metric. 


7.3] PROPERTIES OF DIFFERENTIABLE FUNCTIONS 211 


Therefore the usual notion of continuity applies to the functions on 
X, to L (R™, R"), of which Df is one. If all points of X are interior 
points, i.e. if XY is open, then X, may coincide with X and we are led 
to the following definition. 


Definition. A function f:G > R", where G is an open subset of R™, 


is said to be continuously differentiable if Df(é) exists for all §€G 
and if the function Df:G > L (R™, R") is continuous on G. 


Let (X, p) be an arbitrary metric space. A function 
A: X > L(R™, R”) 


may be represented by a matrix 


eoeeee?#e@ 


Ani ee -Anm 


each element a,; of which is a function on XY to R*. The inequalities 
(theorem 3.25) 


max Jay(x)—a()| < [AC ABI < Clana 


show that A is continuous at the point § € X if and only if each of the 
functions a,;; is continuous at . Combining this result with theorems 
7.13 and 7.14 we obtain a simple criterion for a function to be con- 
tinuously differentiable. 


Theorem 7.34. Let G be an open subset of R™. The function 
fa for 


is continuously differentiable if and only if each of the partial derivatives 
Di ffi = leg MGS = By cs n) exists and is continuous on G. 


It is evident from the definition of continuous differentiability (or 
from theorem 7.34) that, for real functions, the property is equivalent 
to the possession of a continuous derivative. 


Exercises 7(c) 
1. Let X be a subset of R” which contains the points u, v and the segment 
o = {x|x = utt(v—u);0 < t < 1}. 


The function f: X > R! is continuous at u, v and differentiable at all points of o. 
Show that there is a point £ of o such that 


f(ve)-—f(@) = Df) (v—y). 
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2. The function f: R! > R? is given by 
Ti) = Gos XSi): 
Prove that there are points u, v € R! such that 
fvr)-f@) + Df) w-4) 
for all € € R’. 
3. Prove that, under the conditions of theorem 7.31, 
|f@) -—f()— Dfww—-u)| < (sup | Df) — Df) le—ul 
LE 

for allu,veS. 
4, The set Gin R” is open and the function f/f: G > R” is continuously differ- 
entiable. Show that, if K is a compact subset of G, then 

(i) | f+h)—FCO)| +0, 

(ii) || f(e+h)—f(X)— Df) I/||A| > 0 


as ||h|| +0, uniformly for x € K. 


5. Let G be an open, connected subset of R” and let (f”) be a sequence of con- 
tinuously differentiable functions on G to R”. Suppose that (f”(x)) converges for 
at least one x € G and that (Df”) converges uniformly on every compact subset 
of G. Show that 

(i) (f”) converges uniformly on every compact subset of G; 

(ii) f = lim f’ is continuously differentiable in G and, for all x € G, 


| Df(x)-=-lim Df "(x). 


7.4. The implicit function theorem 

As an application of Banach’s fixed point principle we obtained 
in chapter 3 a theorem on the existence of implicitly defined functions. 
We begin by proving a variant of that theorem in which the hypo- 
thesis is a little more stringent and the conclusion is correspondingly 
strengthened. 


Theorem 7.41, Let (&, 4)" be an interior point of a set E in R* and 
suppose that the function f: E > R' satisfies the following conditions: 


(i) f(E, 7) = 9; 


(ii) fis continuously differentiable in an open set G containing (&, 7)* ; 
(ii) Daf(& 7) + 0. 
Then there exist a rectangle 
MxN = [-a, §+a]x[y-/, 7+] 
and a continuous function ¢:M +N such that y = ¢(x) is the only 
solution lying in Mx WN of the equation f(x, y) = 0; moreover ¢ is 
continuously differentiable in M° (the interior of M) and 
$'(x) = —Dif(x, $())/Dof(x, $x) 
for §-a<x < +a. 
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Proof. Theorem 3.47 gives a unique solution ¢, but only guarantees 
its continuity. To prove continuous differentiability we first remark 
that M and N may be taken so small that Mx WN ¢ G and, for 
(x,y)? €MxN, Dof(x, y) + 0. Next, take a point x in the open 
interval (—a, +c) and let h be any non-zero number such that 
xthe[E—a, +a]. Then 


{fxth, P(x+h)) fx, 6% +M)} +{F& Gx +A) SE, SON) = 0 


and so, by the mean value theorem, 
AD, f(u, (x +h)) + {P(x +h) — $(x)} Daf(x, v) = 9, 


where u lies between x and x+h, while v lies between ¢(x) and 
¢é(x+h). Thus, since D, f(x, v) + 9, 


d(xt+h)—(x) _ _ Di fu, G(x +h) 
h Dele, oo 


It now follows from the continuity of the functions ¢, D,fand Df 
that, first, d’(x) exists and is equal to 


tv Dif (x; A(x))/Dof(x, P(x)) 


and, secondly, that 4’ is continuous on (—«, +a). | 


A simple example will serve to emphasize the local character of 
the solution of the equation considered in theorem 7.41. If fis given 
- fle, y) = x-+sin y, 
then in the rectangle [—1, 1] x [—47, 47] there is a unique solution 
passing through the point (0, 0)”. There is no solution through any 
point (x, y)” with |x| > 1; and, if # > 47, solutions in 


f=, te f, A] 


are not unique, ie. there are values of x in [—1, 1] to which there 
correspond two or more values of y in [—/, f] such that f(x, y) = 0. 

The problem of theorem 7.41 has an obvious generalization in 
keeping with the spirit of the present chapter. In the equation 
f(x, y) = 0 we replace the real numbers x, y by points in R” and R” 
respectively. The function f will then have its domain in R” x R” and 
it is easy to see that the range of f should be in R”. For, writing 
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Mies 0G pS = Oe oa a f= bh. af eee 
have the system of equations 


c+ 5k 6s Oa Le ee 2s (7.41) 
Fas eeeg Xms Vis coeg Vn) se | 


which is to lead to unique expressions for ),, ...,y, in terms of 
X1, .++» X_; and our experience shows that the number of equations 
should, in fact, be equal to the number of unknowns. 

The equations (7.41) suggest that the domain of fis in R™™”. On 
the other hand the vector equation 


I(x, y) = 4 


indicates a domain in R” x R”, as was originally stipulated. This dual 
notation is legitimate in view of the natural identification of points 
in R™x R” and R™”. By equipping R” x R” with the norm induced 
by R™+” we make the two spaces entirely interchangeable; and we 
shall use either or both as convenient. For instance we denote by 


D; f(%,; soe9 Xs Vio vers Yn) 


the partial derivative of f,; with respect to the 7th co-ordinate in R™*” 
(an xifl<i<m,ay if m+1<i< m+n) and we also use the 
abbreviation D, f;(x, y). 

The indicated generalization of theorem 7.41 can be formulated 
and proved with the use only of partial derivatives, but it is more 
illuminating to use also a concept of partial differentiation which is 
akin to linear differentiation. | 


Definition. Let f be a function from R™ x R” to R? and suppose that 
(E, 9)" (€ € R™, 7 € R”) is an interior point of the domain E of f. If 
there is a linear function A: R™ — R? such that 


IFE+A, 1) -£E N—-AMD)|| = o(lAl) as Al +0, (7.42) 


then A is the linear partial derivative Da) /f(&, 7) of f at (§ 4)7. The 
linear partial derivative Di f(&, 4), a linear function on R” to R?, is 
similarly defined. | 

The argument of theorem 7.11 shows that linear partial derivatives 
are unique. 


Theorem 7.42, Suppose that the function f:E > R? (E < Rx R”) is 
differentiable at the point (&, 4)". Then Da f(& 4) and De f(é, 1) 


7.4] THE IMPLICIT FUNCTION THEOREM 215 


exist; moreover, for all x € R™ and all y € R", 


Dw f(E, 1) (x) = Df(s, 1); 4) 
and Dw f(E MY) = DFE, 1); Y). 
Proof. By the definition of Df(&, 7), 


IFE+A, 9 +k) —f(E, 2) — DFE, 1)(h, KI] = ofA, AI) 


as ||\(h, k)| +0. Hence, taking k = 0, we have (7.42) with A(A) 
replaced by Df(&, 7)(A, 0). The uniqueness of linear partial deriva- 
tives now gives the result for Dq f(& 7). | 


Corollary. Under the conditions of the theorem 


[Dw f/E |, | Do SCE m)\| < | DFE, 2). 


If the function f = (f,, ...,f,)” from R™ x R” to R? possesses the 
linear partial derivative Dy) f(, 74), then the partial derivatives 


D; f(é 7) Gi = 1, ....m;7 = 1,...,p) exist and Dy /f(é 7) is repre- 
sented by the matrix 


Dy fil + 7) aie aries 1) 
Shee SS (7.43) 


This is proved in the same way as theorem 7.13. Similarly, 
when Dw f(é 74) exists, then the partial derivatives D, f,(&, 7) 
Gi =m+l,...,.m+n;7 = 1,..., p) exist and the matrix of De f(§, 7) 


S Dias fl& 1) + Drran fils 1) 
2 (7.44) 


pS weeny oe 9). Dcca tlio 7) 


Thus, if fis differentiable at (&, 7)7, the matrices of Diy f(&, 7) and of 


Dw f(é, 1) consist, respectively, of the first m and last n columns of 
the matrix of Df(, 7). _ 7 


When p = m, the matrix (7.43) of Da f(g, 7) 18 square and we 


denote its determinant by Ji, f(&, 7); when p = n, the determinant 
of (7.44) is denoted by J f(g, 7). 

We have now laid the groundwork for the principal theorem of 
this chapter. 
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Theorem 7.43. (Implicit function theorem.) Let (£, 7)7(§ € R™, 7 € R”) 
be an interior point of a set E in R™ x R” and suppose that the function 
f:E > R" satisfies the following conditions: 


(i) (8 1) = 9; 


(ii) f is continuously differentiable in an open set G containing 
(E, 7)" 

(ili) Ja) f(E, 7) + 9. 

Then there exist intervals 


M = [£,-«, 8 +0] x... x[Em—% Em +O, 
N= [9:—2, 41+ F1* ».- ln -Fs In FA 


and a continuous function 6:M +N such that y = $(x) is the only 
solution lying in Mx N of the equation f(x, y) = 9; moreover ¢@ is 
continuously differentiable in M° and | 

D¢(x) = —[De Sf, o))I*° Da Sx, P(X) (7.45) 
forxe M?°. 

Proof. The proof follows the lines laid down in the proofs of 
theorems 3.47 and 7.41, except for additional complications which 
arise from the absence of a mean value theorem. 

By (ii), the function J) f is continuous on G. Hence, by (iil), we 
may suppose that, for (x, y)” € G, Ji f(x, y) + 0, so that the linear 


function Di f(x, vy): R” > R” is bijective. Put Dw f(é, 7) = Q. 
Since the function Dfis continuous on G, there are intervals 


K = [€,—6, §,+6]x... x [En—9% Smt], 
N = [1-2 1+A) Pn Bs In FP] 
such that, for (x, y)7’ €KxN, 
| D&E, 1) — Df, y)||_ < 1/(2n4|| O|). 


Then, denoting by J the identity function on R” and using the 
existence and linearity of Q-1, we have 


|T- O-10 De f(x, y)| = ||Q*° (Daf 1) — Do f(x, »)}| 
< || |Deof( )- Dal, y)| 
< ||Q-*| || DA, 2) — Df, y)|| 
< 1/(2n?). (7.46) 
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By (i) and (ii) we can also find a positive number a < é such that, 
for xe M = [£,-4, §,+a]x...x[E,—-0, m+], 


Ife, DI < AITO") 
and therefore | OL, n)]l| < 47. (7.47) 


Denote, as usual, by B(M, N) the set of all (bounded) functions 
w:M -> N, and by C(M, N) the set consisting of all the continuous 
functions in B(M, N). By theorems 3.44 and 3.45, the two spaces, 
equipped with their customary metrics, are both complete. 

Let Q be the mapping on B(M, N) defined by Q() = x, where 


x(x) = (x) - OF ¥Q))] eM). 


Note that, if y is continuous on M, then so is x. 
We first show that Q maps B(M, N) into itself. Let y ¢ B(M, N). 
Then, for xe M, 


QD) -—7 = ¥X)- OAT, YO) -7 
= {(x)- OAL, Yo)} - 9 - OS Mi - OTs I. 
Now, for fixed x, the function 
1-0-0 f(x, JN >" 


(where f(x, .) is the function g:N > R” defined by g(y) = /(, y) 
for y € N) has at y the linear derivative 


I-Q*0 Da f(x, y); 


for, at any point, the linear derivative of a linear function is the 
function itself. Hence, by theorem 7.31 and by (7.46) and (7.47), 


|G) =a] < pal vO)—a +48 < B. 


We next use a similar argument to prove that Q is a contraction 
mapping. If ¥,, ~~, ¢ BWW, N) and xe M, 


QW) (x) — 24/2) @) 
= fi) - QA, iO) - YO) - OA “ayy 
and so, by theorem 7.31 and (7.46), 
QQ @)-Q2U2) |) < thy - v2) I- 
Thus PQVs), 22) < 20Va Y2)- 
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It now follows from theorem 3.46 that Q has a unique fixed point. 
In other words there is a unique function ¢:M-—WN such that 
0Q(¢) = ¢, 1e. f(x, d(x)) = 6 for xe M. (The uniqueness of ¢ 
implies, in particular, that d(€) = 7.) 

To prove that ¢ is continuously differentiable we proceed in stages. 
First we note that ¢ is continuous. For we have remarked that, if y 
is continuous, then so is Q(y). Hence, if Q* is the restriction of Q to 
C(M, N), then Q* is a contraction mapping on C(M, N). Therefore 
Q* has a fixed point and, since a fixed point of Q* is also a fixed 
point of Q, it follows that ¢ is the fixed point of Q*. Thus ¢ € C(M, N). 

The next step is to show that, for x € M°, 


|Pa+h)—¢@)| = O(|h]) as [Al +0. 


Put Dy f(x, (x)) = S. Since S is bijective and ¢ is continuous, it 
follows from theorem 7.32 that, when ||| is sufficiently small, 


f(x, (x +A))| = |S, +A) fe, ¢))| 
1 
2 zS=] | p(x +h) -—¢@)]. 


Also, since Df is continuous on Mx.N, for (u, v)" e Mx N, 
| Da flu, v)|| < |. Df, v)| < A, 
say. Hence, by theorem 7.31, 
IF, Px +A)| = [Far+h, ath) fe, 6a+h)|| < AlAl. 


Therefore, when ||/|| is small enough, 


13 = Sx+h)— (0) < fC, d+) < AlAll, 
or p(x +h) — (x) < BIAl. 


We can now prove that D¢(x) exists and is given by (7.45). Putting 
(x) = y we have 


T(x+h, y+k)—f(x, y)— Df(x, y)(h, k) = rh, k), 
\|7(A, k)| 
(A, k)| 


Now take k = 4(x+h)—¢(x) 80 that f(x, y) = flxth, y+) = 8. 
Then Df(x, y)(h, k) = —r(h,k&), 


where >0 as |(A, ‘| +0. 
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i.e., by theorem 7.42, 


Dy f(% YA+ Dal, yk) = —rh, k), 
and so 
[Da f(x, yl 7° Daf VY} A+Kk = -Daf% VIC *)). 
Thus 
A(x +h)—4(x) + (Da f(x, (x)]-1° Dw f(x, ox) } A) = RA), 


[RO — |S rh, +4) — $00) 
A || 


< ||S-| 7, de +A) — G))|| [Al + BlAl 
2 IA, dx +h)- 9) [Al 


_ clr, $a +h)— $0)! 

= TG, $+) —6C)I’ se 
say. Since |!¢(x+h)—¢(x)|| > 0 as ||A|| > 0, the right-hand side of 
(7.48) tends to 0 as ||A|| > 0. 

Finally we show that D¢ is continuous or, equivalently, that the 
partial derivatives of ¢,, ...,¢, are continuous. This follows from 
the continuity of ¢ and from the matrix equivalent of (7.45) which 
gives each D,¢,(x) as a rational expression in the partial derivatives 
of f at the point (x, d(x)”. | 


where 


Notes. 
(i) The formula (7.45) agrees with the result obtained on differ- 
entiating the equation 


F(x, (x) = 4 (xe M°) 


when ¢ is known to be differentiable. In the case m = n = 1 the 
chain rule immediately leads to the identity 


D, f(x, $(x)) + Daf, $(x)) ox) = 0 
(cf. theorem 7.41). In general define the function g: 4° ~ R™x R” by 
g(x) = (x, A(x)”. 
Then (see exercise 7(a), 7) Dg(x) is given by 
Dg(x)(A) = (A, DO(x) A)’. 
Hence, putting g(x) = z we have, by theorems 7.22 and 7.42, 
0 = D(fog)(x)(A) = [Df(2)o De(~)]) = Df) [Da(*) 
= Df(a)[(h, Dd(x)(A))] = Dw» fODA+ Dof@[DE) MI. 
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Thus Dy f(x, (x))° DE(x) = —Daf(*, P(x). 


(ii) If, in the theorem, m = n, so that the matrices of Da f(x, y), 
Dw f(x, y) and D¢(x) all have n rows and n columns, then 


Jy f(x, P(X) 
Jd(x) = (- 1)" =... 
PO) = (I FF, $C) 
(For if A is ann xn matrix, then det (— A) = (— 1)" det A.) 
(iii) The condition Jj.) f(&, 7) + 0 is not necessary to ensure the 


existence, in the neighbourhood of the point (€, 7)", of a unique 
solution y = ¢(x) of the equation /(x, y) = 9. For instance the 


—— fx, y) = x®-y9 =0 (ye R) 
has the unique solution y = x, but J) f(0, 0) = 0. 


Illustration. Here, as so often in particular examples, we discard the suffix 
notation for vectors and use x, y, u, etc. as co-ordinates. 
The equations 


f(x, y, u,v, W) = x(u?+v*)+ yw®—5 = 0, 
f3(x, JY, U,V; w) srs x(u? + v*)+ yw? —9 eS 


are satisfied at the point (1, 1, 0, 1, 2)”. 
If p = (x, y)?, ¢ = (u,v, w)? and f= (1, f, fs)?, the equations (7.49) may 


be written 
f(D.Q@ = 4. 
Then 
x x y 


Jaf(p,Q =| 2xu  2xv 2Zyw | = 6x*y(v—w)(w—4) (u—v) 
3x" = 3x0" Gyw* 
and so, if p, = (1,17 «65 = ©, 52)", 
J(af(Po» 0) = 12. 
Therefore there is a solution g = (p), i.e. 
u= (x,y), v= (x,y) w= Pa(x, y) 


which passes through the point (po, qo)? or (1, 1, 0, 1, 2)”, and the solution is 
unique in the neighbourhood of this point. 
The matrices of Di f(p, g) and Dw f(p, g) are 


filx, y, u,v, W) = x(ut+v)+yw—3 = 0, 
(7.49) 


u+v Ww x x y 
w+o* wl, t2xu 2x0 2Zyw. 
ur+v > yw 3xu* 3xv? 3yw? 
respectively. Therefore, by (7.45), the matrix of — Ddé(p,) is 
Py See 5 Ga es ee i2 -—9 -2. 71 2 ; oe 
Oo 2 4 i bar 0 12 -4 ee age 8 16}. 
023° AZ 1.8 0-3 2/ \l 8 —-1 4 
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Thus D,¢,0, 1) = -—%, Do¢,(1, 1) = —3, 
D,$2(1, 1) = —3, D.$2(1, 1) = —3, 
D,¢3(1, 1) = ae D,¢,(1, 1) se —4. 


Exercises 7(d) 


1. Let f be a function with domain E < R”x R" and range in R” and let (é, 9)? 
be an interior point of E. 

(i) Show that both Djw»/f(é,7) and Def (é, 7) may exist without f being 
differentiable at (&, 7). 

(ii) Show that, if both linear partial derivatives of f exist in an open ball with 
centre (£, 7)7 and Dw f, Dwf are continuous at (¢, 7)”, then / is differentiable 


at (, 7)”. 
2. Prove that, if (€, 7)” satisfies the equation 
xy—logx+logy=0 (x,y > 9), 


then there is a continuously differentiable solution y = ¢(x) which passes 
through this point and is unique in its neighbourhood. Prove also that there is 
one point (£, 7)7 such that the corresponding solution y = ¢(x) has a maximum 
at &. 


3. The function f: R' > R' is continuously differentiable, f’ is strictly increasing 
and lim f’(x) = 0, lim f(x) = —00. Prove that, to any & + 0, there corres- 


> 00 =O 


ponds an ¥ such that fE+n) = fO +f) 


and that through such a point (&, 7)” there passes a solution y = ¢(x) of the 
equation 
f(xty) = fO)+f0) 


which is unique in the neighbourhood of (&, 7). 
Construct an example to show that, when £ = 0, there may be no corres- 
ponding 7. 


4. Find a point (&, 7, &)7 in the neighbourhood of which the equation 
sin yz+sin zx+sin xy = 0 
has a unique solution z = (x, y). 
5. Show that in the neighbourhood of any point which satisfies the equations 
x4+(x+z)y?-—3 = 0, 
x*+(2x+3z)y?-—6 = 0 
there is a unique solution y = ¢,(x), z = ¢,(x) of these equations. 


6. Show that the equations 
xy+yu'+z = 1, 


xy+yutzv = 1 


have a unique solution u = ¢,(x, y, z), v = $2 (x, y, Zz) in the neighbourhood of 
eee (x, », 2, u,v)” = 0,1, 1, 1, 0)? 
and find the matrix of Dd(0, 1, 1) (where ¢ = (4,, $)7). 
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7. Verify that there are points which satisfy the equations 

x*-+u+e° = 0, 

y?>+vt+e” = 0, 

z2+w+et = 0, 


Prove that, if (x9, Yo, Zo, Uo, Vos Wo)? is such a point, then in its neighbourhood 
there exists a unique solution u = ¢,(x, y, z), v = $,(x, y, z), w = $3(x, y, Z). 
Express the matrix of Ddé(x,y,z) (where ¢ = (¢,, $2, ¢3)7) in terms of 
X,Y, Z, U,V, Wz 
8. Show that there are points which satisfy the equations 

x-—e“cosv = 0, 

v—e”sin x = 0. 


Prove that, if (Xo, ¥o, Uo, Vo)? is such a point, then in its neighbourhood there 
exists a unique solution (u, v)” = d(x, y). Show that 


IP(X, Y) = vx. 


9. The functions f, g: R'—> R' are continuously differentiable and f’,g’ are 
strictly positive. Suppose that the equations 


f(xu)+g(yv) = 0, 
fQv)+2(zw) = 0, 
f(zw)+ 2(xu) = 0 


are satisfied at the point (Xo, Yo, Zo, Uo, Vo, Wo)”, Where Xo, Yo, Zo + 0. Show that, 
in a neighbourhood of this point, there is a unique solution (u, v, w)7 = d(x, y, z) 


with Jacobian 
uvw 


JO(x, y, Zz) = xyz" 


7.5. The inverse function theorem 


Let f be a function with domain and range in R”. The question 
whether f has an inverse can be put in the form: given y = (jy, ..., y,)" 
in the range of f; does the vector equation 


I(x)-y = 9, 
i.e. the system PAM ors XH HED 


Sir X1, sory PM eet = 0, 


have a unique solution for the vector x = (x, ..., x,)" in terms of 
Y = (Vu ---s Yn)*? The problem of inversion is therefore little more 
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than part of the larger problem, solved in the previous section, of 
the existence of implicitly defined functions. 


Theorem 7.51. (Inverse function theorem.) Let & be an interior point 
of a set E in R® and suppose that the senor f:E > R" satisfies the 
following conditions: 

(i) fis continuously differentiable in an open = containing §; 


(ii) Jf(S) + 9. 

Then there exists an open set U containing & such that the restriction 
of f to U is injective; the inverse function ¢ is continuously differentiable 
on the open set V = f(U); and if ye V and y = f(x), where xe U, 


th 
a D¢(y) = LDA@OT. (7.51) 
Proof. Define the function g: E x R" > R” by 
g(x,y) =f(x)-y (ek, ye R”). 


Then g is continuously differentiable at a point (x, y)? as long as fis 
continuously differentiable at x. This follows from the continuity of 
the partial derivatives D,g; which are given by 


[D.f(x) if 1<i<n, 
a a cc <2. 


where 6,, = 1 whenr = sand 0 whenr + s. a aa if f() = 9, then 
g(é, 7) = 0 and g is continuously differentiable in an open ball with 
centre (&, 7)”. Also clearly Diyg(&, 7) = Df(é) and so 


JiyeE, 0) = Jf(é) + 9. 


We may therefore apply theorem 7.43 to g (with the roles of x and y 
interchanged). This means that there are intervals 


Dj g;(x, y= 


sae Eire E,+a]x 10 X (if a E+ al, 
= [1,-2.1 +F)X «Pn -Ps Int PI 


and a continuous function ¢:N > M such that x = Q(y) is the only 
solution lying in M x N of the equation f(x) = y. In other words, the 
restriction of f to (N) (< M) is injective and the inverse function 
¢ is continuous. 

Since ¢ has a continuous inverse, by theorem 3.23, d(N°) is open; 
and, since 7 € N°, £ = (7) € P(N°). Put U = A(N°), V = N°. Then 
V = f(U) and this set is open. 
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Finally, ¢ is continuously differentiable in N° and so in V and 


DO(y) = —[Dws(GQ), YI* ° Deg(GQ), Y). 
Thus, if d(v) = x and J is the identity function on R”, 


Do(y) = -[Df@l*o(-D = (Df. | 
(The formula 7.51 also follows from theorem 7.22 once the exist- 
ence and differentiability of ¢ have been established.) 
Corollary. Under the conditions of the theorem 
Jo(y) = 1/Jf). 
Illustration. The function f: R? > R? is given by 
f(x, y) = (xe, xe"). 


eY xe" 
e~¥ —xe-¥ 


= — 2X, 


We have Jf (x, y) = 


Hence theorem 7.51 guarantees the existence of a local inverse (x, y)? = (yu, v), 
i.e. a unique solution of the equations 


xe" = n, xe" = 8, (7.52) 


in the neighbourhood of any point (&, 7)? such that € + 0. The matrix of 
DOO e ev: Ket FO axe * oe 
bape Br mem 0.39 


ev —xe- 2x \e-¥ —eY 


In fact it is easy to find ¢ explicitly. The range of f is the set {(u, v)7 |uv > 0} 
and the equations (7.52) lead to 
= +(w)*, y = log (u/v)* 


when u,v + 0, i.e. x + 0. If € > 0, the local inverse is x = (uv)*, y = log (u/v)? 
and, if § < 0, itis x = —(uwv)?, y = log (u/v)*. When & = 0, there is no inverse 
for then u = v = 0 and f maps any point (0, y)” into (0, 0)7. Finally we note 
that the matrix of Dd¢(u, v) is 
( + 4y-*yt ant 
dy —4ty 


according as x = (uv)? or x = — (uv), and this is the same as (7.53). 


The one-dimensional form of the inverse function theorem is not 
new (see C1, 62 and 69) and may be extended to provide for a global 
instead of only a local inverse. For if a real function has a continuous, 
non-vanishing derivative in an interval, then the function is strictly 
monotonic and therefore injective. In higher dimensions there is no 
corresponding extension. If the Jacobian of a function f does not 
vanish on an open set, then f need not be injective on that set (as a 
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whole) even if the set is connected. For instance the function 
J: R? > R? given by 


We y= cosy, fix, y).= asin y 


is not injective, since all the points (x, y+ 2nm)7(n = 0, +1, +2,...) 
have the same image; but Jf(x, y) = e > 0 for all (x, y)” € R?. 
Nevertheless the next theorem shows that the non-vanishing of the 
Jacobian entails a property which had previously been known only 
for functions with continuous inverses. 


Theorem 7.52. Let E be a subset of R”. Suppose that the function 
f:E — R° is continuously differentiable on the open set G © E and that 
Jf(x) + 0 for x € G. Then f(G) is open. 


Proof. Take any point 7 € f(G) and let ¢ € G be such that f(S) = 7. 
Then é is an interior point of G and, by theorem 7.51, there is an 
open set U in G which contains & and is such that V = f(U) is 
open. Since 7 € V and V < /(G), it follows that 7 is an interior point 


of f(G). | 


Exercises 7 (e) 


1. The function f from R” to R” is differentiable at the point €. Prove that the 
condition Jf(€) + 0 is not necessary to secure the existence of an inverse in the 
neighbourhood of the point £; but that it is necessary for ensuring the differ- 
entiability of this inverse. 


2. Show that the function f: R? > R? given by 
f(x, y) = (cosx+cosy, sin x+sin y)™ 


has an inverse in the neighbourhood of all points (€, 7)7 such that €—y9 + n7 
(n = 0, +1, +2, ...) and that at all other points no local inverse exists. 


3. Prove that, if §, 7, € + 0, then the function f: R® > R® given by 
f(x, y, Z) = (yz, 2x, xy)* 


has an inverse ¢ in the neighbourhood of (&, 7, 7 and find the matrix of 
D¢(yz, zx, xy). 


4. Prove that the function f: R? > R? given by 


fe ea 
fon= (ars: HT) 


has an inverse in the neighbourhood of every point of R?. If ¢ is the local 
inverse at (x, y)7, find Jé(/(x, y)). 
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5. The function f = (/,, fi, fs)" : R? > R® is given by 

IG, Y, Z) = XY +z, 

SAX, ¥. 2) = 2 AXES, 

fa(X, ¥, Z) = XYZ; 


and &£, 7, © are (pairwise) unequal real numbers. Show that f has an inverse 
in the neighbourhood of the point (&, 7, ¢)7 and that 


JOS, y, 2) = —Iv-2E-x)@-y)]?. 
6. Show that the function f: R® > R” (n = 3) given by 
Lf Otay vivg Bn) = Gt Xe, Huta Rey is Oe OS Ge ee 
has an inverse ¢ in the neighbourhood of the point (1, 1, ..., 1)7 and prove that 
02, 2, 5 DEH. 


7. Let G be an open subset of R” and suppose that the function f: G > R” is 
continuously differentiable and that Jf(x) + 0 for all x in G. Show that the 
image /(F) of a closed subset F of G need not be closed. 


7.6. Functional dependence 


The last theorem was stated as a result on mappings. It acquires 
a different aspect if its conclusion is replaced by the following (weaker) 
statement: a function ¥ from R” to R! for which Y’(f(x)) = 0 when 
x € Gis such that Y(y) = 0 in an open set. In this form the theorem 
is related to the notion of functional dependence. 


Definition. The real-valued functions f,, ...,f,, with domain E in R™ 
are said to be functionally dependent in an open subset G of E if there 
is a function ¥ from R® to R which does not vanish identically in any 
open set and which is such that ¥(f,(x), ....f,(x)) = 0 for x eG. 

The term functional independence has the obvious meaning and 
theorem 7.52 now leads to the following result. 


Theorem 7.61. Let E be a subset of R”. If f = (fy, ....f,)7:£ > R" 
is continuously differentiable in the open set G < E and Jf(x) is not 
identically zero for x € G, then f,, ..., f, are functionally independent 
in G. 


Proof. Let § € G be such that Jf(é) + 0. Then J/(x) + 0 for x in 
an open subset H of G and f(H) is open. Hence, if ®(f/(x)) = 0 for 
all x € G, then ®(y) = 0 in the open set f(H). | 


The pair of functions f;, f, on R® given by 


A(x, y) = xcosy, fx(x, y) = x sin y 
is independent in any open subset of R? since every such set contains 
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points at which the Jacobian x does not vanish. A simple example of 
functional dependence is provided by 


gi(x, y) = cos(x+y), go(x, y) = sin (x+y). 
For the function ¥ on R? given by © 
WV(u, v) = w+v2—-1 


only vanishes on the circle u?+v? = 1 and Y(g,(x, y), g2(%, y)) = 0 
for all (x, y)? € R’. 

The property of linear dependence, familiar from algebra, is, of 
course, a particular case of functional dependence. Another special 
case arises from the next definition. 


Definition. Let E be a subset of R™ and let G be an open set contained 
in E. If the functions 


f=t(f, inf 2A. 8 = Cp veep Bp) E > RP? 
are such that e(x) = O(f(x%)) (xEG) 


for a suitable function ® from R" to R®, then g is said to be functionally 
dependent on f in G. 

It is easy to see that, if g is functionally dependent on f, then each 
set of functions f;, .... fn. 2; (J = 1, ..-, p) 18 functionally dependent. 
(See exercise 7(/), 7.) 


Theorem 7.62. Let (€, 7)" (& € R™, 9 € R”) be an interior point of a set 
E in R™x R® and suppose that the functions f = (fi, ...s fn)" +E > R” 
and g = (81 «+s Zp)? : E > R? satisfy the following conditions: 
(i) fand g are continuously differentiable in an open set G containing 

(&, 9)7; 

(ii) J f(E, 1) + 95 

(iii) the function h = (fi, .-->fins Sv +s Sp)" is such that, for 
(x, y)" € G, the rank of the matrix of Dh(x, y) is m. 

Then there is an open Set containing (&, 9)" in which g is functionally 
dependent on f. 


Proof. Define the function f*:E x R™ > R™ by 
f*(x,y,2) = f(x, y)—-z (x,y)? € Eze R”). (7.61) 


Then f* is clearly continuously differentiable in an open set con- 
taining (&,7, 6)", where ¢ = /f(& 7). Also f*(€,7,¢) =@ and 
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Ja f*(& 9, 5) = Jay f(é, 7) + 0. Therefore, by theorem 7.43, there 
are closed intervals 
M= [és Ei, +a] Kee XK an cael 
N= [m1 —P, m+] Kae X Ps +7, 
P=[6,-2 &t+Alx...x[om—Bs Sm +A] 
and a continuous function ¢:N x P > M such that x = ¢(y, z) 1s the 
only solution lying in M x Nx P of the equation f*(x, y, z) = 0. The 
intervals M,N are taken to be so small that MxN¢G and 
Jin F(X) Y) + OinMxN. 
For? 21, 2a, 7. 
SE(GO, 2), ¥,2) =9 (2) € NXP). 
Since ¢ has components 4, ..., Om, the chain rule gives 
Difi'(G(Y, 2)s Vs Z) Dj Gly, 2) +--+ Din fi (PO, 2), V2 2) Dj Pm(Y> 2) 
~ Dinsgl LPO Z), ys Z) = 0 
forj = 1, ...,n. Writing (y, z) = x and using (7.61) we now have 
D, f(x, y) D390, z)+ ees + DiS VY) Dj On; Z) 
= —Dmig fy). (7.62) 
Now let g*:N x P > R® be the function defined by 
g*(y, z) = a(G,2),¥) (0, 2)" € NXP). 
Considering kth components we obtain from the chain rule 


Dy g(x, ¥) D3 P10 z)+...4¢ Dg, VY) Ds Om; Zz) 
— D; gi, 2) — — Dans 5 ZiAX y) (7.63) 


for j = 1, ...,”, where again x = (), Z). 
Let j and k be fixed. From the linear equations (7.62) correspond- 
ing toi = 1, ..., m and the equation (7.63) we obtain 


| D, f(x. y)--- Dig F% V) 0 | 


* 
oD? Dey ee ee 
Dy gx, y)-.-Dn 8x, Y) —1| 
Di Aix, P).5 BD, File y) ee See Ee y) 


= Di SulXs Dead gd y) meee eS y) : 
| D, glx, V).. Dy Biss y) Dict GMs y) 
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The determinant on the left is equal to —Jq f(x, y) + 0, and the 
determinant on the right is zero, by condition (ii). Hence | 


D, gi, Zz) sate 0 
for (y,z)? €NxP. This holds for j = 1,...,” and: ke =-1, ...3, mM. 


Therefore, by theorem 7.33, given any z€P, g*(y, z) is constant for 
yeN. Let ©:P > R? be the function defined by 


O(z) = g*(y,z) (VEN, zEP). 
There is a positive number r < min (2, /) such that 


If, y)-FE MI < F 


if ||x—£]] < rand |y—y|| < r. Denote the open balls BE; r)(< R”) 
and B(y;r)(< R”) by U, V respectively. 

Take any point (x, y)™ « Ux Vand put z = f(x, y). We then have 
xeM,yeN,zeP and f*(x,y,z)= 9%. But, given any point 
(v, w)" ENxP, u = ¢(v, w) is the only point in M satisfying 


f*(u, v, w) = 8. 


It follows that x = 4(y, z). Therefore the function ® is such that, 
for (x, y)? € Ux V, 


2(x, y) = 2(Py, 43 y) =e 2*(y, Z) = O(z) sis O(f(x, y)). 


Incidentally, since ¢ is continuously differentiable on N° x P°, ® is 
continuously differentiable on P°. | 


The theorem is again Jocal. Even if condition (ii) is strengthened to 
Jf (x, y) being non-zero at all points of the open set G (which may 
be connected), there need not be a function ® such that g(x) = ®(/(*)) 
throughout G. Take, for instance, 


Ai, X95 y) = ut COS Xo; F(X) X25 y) oe ea sin X95 
2(X1, Xe, Y) = Em7TYX>. 
Then Ji f(x, y) = e+24 > 0 in R? and the matrix of Dh(x, y) has 
constant rank 2, but clearly different functions ® have to be used in 
the strips 2k7 < x, < (2k+2)m (k = 0, + ee ee Bs 
The analogue of theorem 7.62 in which n = 0 (Le. y Is absent) 
takes the obvious form. The proof only requires an application of the 


inverse function theorem. (See exercise 7(f),9.) Changing our 
notation slightly we now combine the two cases > 0 and = 0. 
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Theorem 7.63. Let & be an interior point of a set E in R™ and suppose 
that the function f = (fi, ....f,)t:E > R” satisfies the following 
conditions: 

(i) fis continuously differentiable in an open set G containing &; 
(ii) for x € G, the rank of the matrix of Df(x) is r, whereO0 < r <n. 
Then there is an open set containing & in which n—r of the functions 

Fis «+9 Sn are functionally dependent on the remaining r functions. 

(When r = 0 all the functions f,, ..., 4, are constant in all con- 

nected open subsets of G.) 


Exercises 7(f) 
1. Show that the functions f, g, h on R* to R' given by 


S(x,y, z) = xsinycosz, g(x,y,z)=xsinysinz, h(x, y,z) = xcosy 
are functionally independent in every open subset of R?. 
2. Show that the functions f, g on R? to R'! given by 
Fx, y) = xyt+e*, g(x, y) = xy—e™ 
are functionally dependent in R®. 
3. Prove that there is an interval [—«, a]? x[—/, £]° in R® in which the equations 
x+y+2= u’ +p, 

x?+y24 72 = y3+yp, 

x§+yi+73 = wi+y 
have a unique solution u = ¢,(x, y, z), v = $,(x, y, Z), w = O3(x, y, z). Prove 


also that $1, 2, ¢; are functionally independent in (— a, «)3. 


4. Prove that the functions fi, ...,f, on E(< R”) to R! are functionally de- 
pendent in the open subset G of FE if and only if f(G) (where f = (f, ..., f,)7) has 
no interior point. 


5. Given a real-valued function F on a metric space (X,,/), denote the set 
{x € X|F(x) + 0} by A. Then the support of F is defined as the set A (i.e. as the 
smallest closed set outside which F(x) = 0). 

Let f= (f, ...,fn)™ be a function with domain E in R” and range in R*. 
Show that f,, ..., f, are functionally dependent in an open subset G of E if and 
only if there exists a function ‘Y from R” to R! which is such that its support 
contains f(G) and V(f(x)) = 0 for xEG. 


6. The real-valued functions f, ...,f, with domain E < R” are functionally 
dependent in an open subset G of E. Show that, if g is any real-valued function 
on E, then fi, ..., f,, g are functionally dependent in G. 


7. Let E be a subset of R” and let G be an open subset of E. Show that if, in 
G,g: E> R' is functionally dependent on f= (f,, ..., f,)7: E> R", then the 
set of functions f,, ..., f,, g is functionally dependent in G, 
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8. The (continuously differentiable) real functions fi, f, are defined by 


x* sin (1/x) (x + 9), 
0 (x =O). 


Show (by use of 4) that f;, fe are functionally dependent in R’, but that neither of 
fi fn is functionally dependent on the other in any open set containing the origin. 


Chee fe | 


9. The set Gin R” is open and the function f: G > R® is injective. Prove that any 
function g: G — R? is functionally dependent on fin G. 


10. The functions f, g, h on R? to R' are given by 
fy) = 40?—-y), a(x, y) = xy, Ax, y) = (+ y*) cos x. 
Show that, in R?, A is functionally dependent on f, g. 


11. The functions f, g, h on R‘ to R' are defined by 
f(w, x, ¥,Z) = wtxt+ytz, ew, x,y, Zz) = wtxP+y?tz%, 
h(w, x, y, z) = w2+x3+ y3+ 23—3(xyz+ yzw+zwx+t wxy). 
Show that, in R*, A is functionally dependent on f, g. 


7.7. Maxima and minima 


A real valued function fon a set XY in R™ is said to have a (local) 
maximum at the interior point § of X if there is a 6 > 0 such that 
f(x) < f(é) whenever x € B(E; 6); a minimum of fis similarly defined. 
The corresponding definitions in C1 for functions on R’ ($4.7) and 
on R? (§8.8) differ slightly in requiring strict inequalities for x in 
B(E; 6)—{&}. We have here adopted the more comprehensive defini- 
tion because in this chapter it will be convenient to be sure that a 
function has a maximum (minimum) at a point where it assumes 
its supremum (infimum). 

It is easy to see that, when / is differentiable at £, a necessary 
condition for fto have a maximum or minimum at & is that 


Df(é) = 9, 


the zero linear function. (See exercise 7(g), 1.) There are also 
sufficient conditions for a maximum or minimum; the cases m = 1, 2 
are treated in C1 (§4.7, 8.8); for the general case see exercise 7(h), 6. 
However our present concern is to extend the theory in a different 
direction: we shall consider maxima and minima of functions relative 
to the values taken on subsets of their domains. The most interesting 
case is that in which the subsets are determined by equations of the 
form g(x) = 8. 
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Definition. Let X be a subset of R™ and let g be a function on X with 
values in R%, where 1 < q < m. Denote the set {x|g(x) = 9} by Xo 
and suppose that the point € in X,y is an interior point of X. The function 
f:X — R' is now said to have, at £, a maximum [minimum] subject 
to the constraint 2(x) = 0 if there is a 6 > 0 such that f(x) < f(&) 
[f(x) = f(§)] whenever x € Xn BE; 9). 


The next theorem gives a necessary condition for a constrained 
maximum or minimum to occur at a given point. 


Theorem 7.71. Let & be an interior point of a set X in R™(m > 2) and 
_ suppose that the function g = (g, ..., Z,)7: X > R%, where <q < m, 
satisfies the following conditions: 


(i) g(&) = 9; 
(ii) g is continuously differentiable in an open set G containing £; 
(iii) the matrix of Dg(€) has (maximum) rank q. 
If the function f: X > R} is also differentiable in G and if f, subject 
to the constraint g(x) = 0, has a maximum or a minimum at the point 
£, then there are real numbers A,, ..., A, such that 


D, f(§) +4, D; gi($) +... +A, D; £(8) = 0 (7.71) 
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The numbers A,, ..., A, are called Lagrange’s multipliers. 


Proof. We may suppose that 


Dy ori 81(8)- . Din 2i($) 
Teer er ee ee ee Se + 0. (7.72) 


Dy coi Boks) ** Dy GX5) 


Now put p = m-—gq and regard f, g as functions with domains in 
R? x R4. Also write § = (9, €)7 (ve R®, €€ RY. 
By the implicit function theorem there are an open interval 


M° = (9, —&, 94 +%) x... X (Mp — 2, Ny +.) 


and a continuously differentiable function @ = (4, ..., @,)7: M° > R? 
such that ¢(7) = ¢ and 


g(y, P(y)) = 9 for yeM?. 
Take any he R? and let Dd(y)(h) = k(e R®). By equation (7.45) 
then h 
we thes have. Fy oy, ate 
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and, by theorem 7.42, this is equivalent to 
De(y, 6)(h, k) = 4. (7.73) 
Define the function % = (Wy, .... Wn)™(—& &) > R™ = RP*4 by 
94+ th, (1 <i< p), 
b:o+th) (ptl1<i< pt). 
Then, by the chain rule, 


vio = | 


h; (| <7 & p), 
Do: (p+l1 <i< pr). 
Since D¢(y)(h) = k, the last line may be written 
Wyi0) =k; A </1 <q). 
Next, let vy = fo y, so that 
y(t) = f(y t+th, dyt+th) (-a<t< %). (7.74) 


Then y has a maximum or a minimum at 0 and so x'(0) = 0. Using 
the note after theorem 7.22 we therefore have 


0 = x0) = Df, om) Y'O) = PfG, NYO) 
= Difta, Dy t...+ Dp f( $) hp 
+ Dyisf(y, S)ki +--+ Dora FO Sq (7-75) 


We have shown that, given h € R®, there is k € R% such that (7.75) 
and (7.73) hold. Putting 


Dfa,9 = 4 Digs, ©) = by 


fori = 1,....m = p+qand/ = 1,...,q we express these equations as 


¥i(0) = 


Ayhy +... +Gp Myp+dga, yt... +Qpig Kk = 0; 
by hy t... + Biphy t+ By pirkit--- + Dy pigke — 0, 


Bolt, + -2- + Dgp Mp + Dg pak t «0g. nigkg = 9: 
Moreover (7.72) now takes the form 


Dy ps1-+P1, p44 


= ee on). (7.76) 


Drggeye De | 
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The g+1 vectors 


(Qy41 eres Meu) (54, p41 eeeg De gual > eeeg (De oct e0eg De ccar 
(each with gq components) are linearly dependent, i.e. there are 
numbers Ao, A, ..., A,, not all zero, such that 

Al Gy uts sega yd HAQDY g1ts\-pPipag) Fs: 

+A (Dy saa og Oy og) =. 
But, by (7.76), 


(Dy, o+15 sees eee eeeg ©, ost eeeg ee al 
are linearly independent and so we may take A, = 1. Thus 


Aap + AzDy, poi ts $Ag bg ni = 0 (7.77) 
for? = 1, ...,@. 
Now, for a fixed i between 1 and p, take h; = 1 and h, = 0 for 
1<r<p,r +i. There are ky, ...,k, such that 


A; +Aniiky + eee + An+¢ - — 0, 
by + By patkit eee + Dy gigKe = 0, 


bes + Be osakych id PB hy =O. 


Multiply these equations by 1, A;,...,A, respectively and add. In 
view of (7.77) we have 


a; Ss A163; + ot Ag boi =D (7.78) 


and this holds for i = 1, ..., p. The two sets of equations (7.78) and 
(7.77) combined are equivalent to (7.71). | 


In a situation of the kind described in theorem 7.71, the multipliers 
A,,...,A, are of no particular interest and their role is mainly 
auxiliary. The q equations g,(£) = 0, ..., g,(&) = 0 and the m equa- 
tions (7.71) make up a set of m+q equations for the m+gq unknowns 


fic Get Si cae 


The points (, ..., £,,)7 obtained in this way are called critical points, 
and among them are any points of XY at which constrained maxima or 
minima occur. There remains the final investigation whether a given 
critical point actually yields a maximum or minimum. This is usually 
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best carried out on an ad hoc basis, for, although a general procedure 
exists, it is cumbersome. (References will be found in the notes at the 
end of the chapter.) However the related problem of finding the 
largest and least values of f (as distinct from Jocal maxima and 
minima) is often quite simple. For suppose that fis differentiable and 
g is continuously differentiable on the open set XY, that Dg(x) has 
maximum rank throughout X and that the set X, defined by the 
equation g(x) = 0 is compact. Then f assumes its bounds on Xo. 
Moreover, if § is a point of X, at which a bound is attained, then the 
corresponding function x of (7.74) has a maximum or minimum 
at 0. This means that & is a Critical point. Thus, to obtain the bounds 
of f on X, it is only necessary to compare the values of fat all critical 
points. 


Example. An ellipse in R? is given by the equations 
glx, Vs Z) = 2x? yh = 0, 
8X, )y,Z) = xtyt+z = 0. 


Find the points on the ellipse which are (a) nearest to and (5) furthest from 
the y-axis. 
We wish to find the points at which 


IG), 2) = + 2, 


subject to the constraint g(x, y, z) = 9, takes its largest and least values. 
The two determinants 


D,g,(x, y, Zz) Dsgi(x, y, Z) " 4x : are 
Dyg2(x,y,Z) Dsg2(x, y, Z) ee oe : 
Dgi(x, J, Z) Dsgi, y, Z) 3a a Z = 2y 

Dzg(x, Vs z) D3g(x, Vs Zz) 1 1 


cannot both vanish at a point satisfying g,(x, y, z) = 0. Hence theorem 7.71 may 
be used and, at a point giving a constrained maximum or minimum, the 


equations 2x+A,4x+A, = 0, 
Ay2y+ A, = 0, 
2z +A, = 0 


hold for suitable values of A, and A,. Therefore, at such a point, 


i ee 2 ae | 
O-2y -14-= 0, 
ae 6 


1.e. xy+z(2x—y) = 0. 
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Substituting for z from g2(x, y, z) = 0 we get 
27 —y* = 0. 
The equation g,(x, y, z) = 0 now gives x? = 1. The critical points are therefore 
p = (1, 42, -1-y2)7, ¢ = (1,—v2, -1+2)?, 
r= (—1,/2,1-—,2)7, s = (-1, —V2,14+ 72). 
Since f(p) = f(s) = 442,/2 
and f@ = fr) = 4-22, 


the points g, r are nearest to and the points p, s are furthest from the y-axis. 


Exercises 7(g) 
1. Show that, if the real-valued function fon the set X¥ < R” is differentiable at 
the (interior) point § of X and hasamaximum ora minimumaté, then Df(g) = ©. 


2. Prove the following generalization of Rolle’s theorem. Let G be a bounded, 
open set in R™ and let the function f: G— R' be continuous on G and dif- 
ferentiable in G. If f is constant on frG, then there exists a point € € G such 
that Df(f) = O. 


3. Find the largest and least values of xy subject to the constraint x?+xy+y? = 1. 


4. Find the largest and least values of (y—z)(z—x)(x—y) subject to the con- 

dition x?+ y?+z? = 2. 

5. Obtain the maxima and minima of 

x?+ y2+ 7?-—yz—zx—xy 

subject to the condition : 
x?+ y?4+ 27-2x+2y+6z+9 = 0. 

6. Calculate the dimensions of the rectangular box of largest volume which can 

be made from a sheet of area a’, if the box is (i) to have a top, (ii) not to have 

a top. 


7. Find the largest and least values of x*+y%+z* subject to the conditions 
x?+y?+z77 = 1landxt+yt+z=1. 


8. Obtain the largest and least values of 
Ax+yt+z)—xyz 


on the closed ball {(x, y, z)7|x?+y?+z? < 9}. 
(Consider 2(x + y+ z)— xyz (i) without restriction, and (ii) under the constraint 
x?e+y?+z% = 9.) 


9. Find the points of the set in R* defined by 
x2 Ae ee 


which are, respectively, nearest to and furthest from the origin. 
(Consider x?+ y?+z? under the constraints (i) x+z = 1, Gi) x+z = 1 and 
4x?+y? = 16.) 
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7.8. Second and higher derivatives 


The purpose of this section is to give a short account of linear 
derivatives of order higher than the first. The notion of partial 
derivative of arbitrary order is taken for granted (see C1, 154) and 
we now only fix the notation. If fis a function from R” to R* 
D;D,f(é) means D,(D,f(é)) and D; D;f(&) is abbreviated to Dj F(§). 
For partial derivatives of higher order the notation is analogous. 

Although in §7.1 linear derivatives were defined only for functions 

from R” to R", we remarked that precisely the same definition may 
be adopted for functions with domain and range in any normed 
vector spaces. Now, when f is a function from R™ to R”, Df is a 
function from R™ to the normed vector space L(R”, R”). We can 
therefore speak of the linear derivative of Dfand the definition below 
is the natural consequence. 
Definition. Let the function f:X > R", where X © R", be differenti- 
able on the subset X, of X; and let & be an interior point of X,. Then 
f is said to be twice differentiable at & if Df is differentiable at &, i.e. 
if there exists a linear function w: R™ > L(R™, R") such that 


LAE +A Die) — ath +0 as |\h| +0 


(where the norms in numerator and denominator are those in L(R", R”) 
and R™ respectively); the linear function y is called the second linear 
derivative of f at & and is denoted by D?f(§). 
It is easy to prove (as in theorem 7.11) that second linear deriva- 
tives are unique. 
When /is a function from R” to R” and is twice differentiable at &, 
D*f(é) e L(R™, L(R™, R”)), 
D*f(E)(u) € L(R™, R") (ue R”), 
D*f(E)(u)(v) € R” (u, vE R”). 
The function D2f(£) is not easy to visualize; but D®f(£)(.)(.) 1s 
simply a function from R™ x R” to R”. It is the function x defined by 
xu, ¥) = DFQWO) WveR), 
which turns out to be bilinear. By this we means that x(u, .) is a 
linear function (on R™”) for every ue R™ and x(.,v) is linear for 
every v € R”. In other words, 


x(cu, v) = x(u, cv) = cx(u, v) 
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and 
x(t+u, v) = x(t, v)+x(u, v), xX(U,vt+w) = x(u, v)+Xx(u, W) 
for every c € R! and all ¢, u, v, we R™. This is very easy to see. First 
xu, = D*f(2)@) 1s linear, as D*f() has values in L(R”, R”). 
Secondly, since D?/(€) is linear, 
D*f(E)(cu) = cD*f(E)(u) 
and D*f(E)E+u) = Df) (+ DE) (U), 
that is, for every v, 
D*f(E)(cu)(v) = cD*f(E)(u)(v) 
and DFf(EE+ue) = DAHNOW+ DASW). 


We shall show further that y(u, v) is symmetric in u, v, but the proof 
is rather more difficult. 


Theorem 7.81. If the function f:X — R", where X < R”, is twice 
differentiable at the point &, then the bilinear function D*f(€)(.)(.) is 


symmetric, i.e. D*f( £)(u)(v) = D?f(E)(v)(u) 
Sor all u, v € R™. 


Proof. Let ¢ be any positive number. There is an open ball B(é; 30) 
(in X) within which fis differentiable and such that 


| Df(E +7) — Df(E)- DEV) || < ella (7.81) 
whenever 9 € B(£; 36). Take any points h, k in B(é; 6) and put 


| A= f(E+h+k)—f(E+h)-f(E+%) +f(8). 
Now define the function g: B(@; 20) > R” by 
g(x) = f(E+h+x)-f(E+x) (|x| < 29), 


so that A = g(k)-—g(9). 
Then g is differentiable in B(@; 20) and 
Dg(x) = Df(€+h+x)— Df(E+x). (7.82) 


Also, by exercise 7(c), 3, if S is the closed ball with centre 6 and 
radius ||k||(< 9), 


|| A — Dg(0)(k)|| = |lg(A) —g(@) — Dg(#)(-)| 
< (sup || De(x) — Dg(6)||) |All. (7.83) 
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Let x € S. Then, by (7.81), 
|| DAE +h +x) — Df(S)— Df(E(A+x)|| < e({|Al + [l>l) 
and || DAE + x) — Df(E) — DF(E)@)I|_ < ella, 
so that, by (7.82), 
|| Dex) — D2f(E) || < (Al +2|[x1) < 2e(([A +] AI). (7.84) 
It follows from (7.84) and (7.83) that, first, 
| Dex) — Dg(®)|| < 4e((|A + ]|Al)) (eS) 
and then JA— D®f(E)(h)(A)|| < 6e([|Al| + AI) AI. 
Since A is symmetric in h and k we also have 


|A— DFE) < 6e(Al + Al) [Al 
and therefore 
| D2F(E) A) (kK) — D2F(E) (A) (A) < 6e([|Al| + ||AI))*. 
The last inequality was proved under the assumption that |Al, 
||K|| < 6. But, ifu, v are any points of R”, we may choose the positive 
number « so that ||au||, ||av|| < 6 and then 


| D26() (u) &~) — DFE) @) @)I| = | DFE) xu) (x) — D7F(E) (ev) (eu) | 
< 6e(|ou|| + |lov|])?/a2? = Ge(||wl| + [[v||)?. 
As this inequality holds for all e > 0, the left-hand side must be 0. | 


It is not difficult to show that any bilinear function 7 on R” x R™ 
to R” is of the form 


m m 
x(u, v) = x x Buvy Cu = (Uy, 0-5 Ug)? DV = (V4, -0+5 Um)” € RK), 
4= Ye 


where the a‘/ are constant vectors in R”. We shall show that, for 
D*f(£)(.)(-)s 
ai = DD; f(€) = (D:D; A(&), ---» Di Dj fnl€))”- 


The proof does not require the general result just mentioned; it 
follows from the next theorem which has independent interest. 


Theorem 7.82. Suppose that the function f = (fi; ....fn)7:X > R", 
where X < R™, is differentiable in the set X, < X and let v be any 
given point of R™. If f is twice differentiable at § (€ X,), the function 
Dy fur t+... + Din SiVin 7 
ene ase ee ree (7.85) 
D, fty +... + DaSntm 
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from X, to R” is differentiable at & and 

Dg(é) = D®f(€)(.) ), 
i.e. Dg(E)(u) = Df(E))() (we R”). 


Proof. Let 
Df(é+h)— Df(S)— DF(S)(A) = rh) 


so that r(h) « L(R”, R”) and |r(A)|| = o(|h||) as ||h|| > 0. Then 

DAE +h) (ve) - Df(S)M-DF(EV\A() = r(A)@) (7.86) 
and ||r(A)(v)|| < ||r(A)|| ||| = o(Al]) as |A|| > 0. But the left-hand side 
sl es e(E+h)—g(6)- DVM) 


and D?/(£)(.)(v) is a linear function on R™ to R”. Hence g is 
differentiable at € and 


Dg(é) = D*f(E)(.)@)- | 


Theorem 7.83. If the function f = (fi, ...,f,)?:X > R" where 
X < R", is twice differentiable at the point £, then 
(i) all the partial derivatives D; D; f,(§) Gj = 1, ....m;k =1,...,n) 
exist; 
Gi) D,D; f(s) = DD: f{S) Gj =1....m; k =1,..., 0); 
(ui) for all u = (Uy, ...5Um)*, 0 = (Vy, «+> Un)* € R*, 


EE DD AOur, 
D2f(E)(u)(v) = = z D;D;f(é)ujv; = Povrcrc recs cee eee 
= z D; D; f,{S) Uz; 

(7.87) 


Proof. In theorem 7.82 take v = e! (the vector with jth component 1 
and other components 0). Then the function 


g! a D,f = (D; fi, cory Di tay 


is differentiable at € and so, by. theorem 7.13, all partial derivatives 
of D;fi,.... Dj f, exist at §. This is true for all 7 and therefore (i) 
follows. 
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Now take an arbitrary v ¢ R™. The function g given by (7.85) is 
differentiable at and the matrix of Dg(&) 1s 


DID, fi) +--+ Din iE Wm] --- Dink Di (S01 + «++ + Din AlS)U ml 


DID, f,(8)vr +... + Din Fn(SWml- +» Dink Di fr(S)e1 + -- + Dn fdSeall 


Since each D;D,f,(&) exists, the formula D?f(£)(u)(v) = Dg(é)) 
now gives the expression (7.87). 
To prove (ii) we use (iii) and theorem 7.81. We have 


D:D; f(@) = DYE EV(e) = DfEE) (e) = Dj DSE). | 


It may be shown that the existence and continuity of all second 
order partial derivatives of a function ensure that the function is 
twice differentiable (cf. theorem 7.14). For this reason, part (ii) of 
theorem 7.83 invites comparison with theorem 7.83 of C1, where it 
is proved that D,D,f,({é) = D,;D;f,(é) if D:D,f, and D,;D,f;, are 
continuous at £. Two further results of the same type are embodied 
in exercise 7(h) 3, and in the theorem below which will be needed in 
chapter 10. 


Theorem 7.84. Let f=(fi,..»f,)" be a function with domain 
X < R™ and range in R"; and let & be an interior point of X. If D;f;, 
and D,f;, exist in an open set containing &, and are differentiable at &, 
then D;D;f,(&) = D; Di f,(§). 


Proof. It is clearly sufficient to take m = 2 and n = 1. We there- 
fore have to show (with a slight change of notation) that 


D,D2f(&, 9) = Dz2Dif(E; 1). 


Suppose that D,f, D,f exist in an open ball with centre (£, 7)” and 
radius 26. For 0 < |A| < 0, put 


Ath) = f(E+h, 9 +h) —-flE+h, 1)-F(E 9 +h) +f(E, 1)- 
We now define the differentiable real function g on (— 0, 6) by 
a(x) = f(E+x,9+h)—f(Et+x, 9) (|x| < 9). 
Then A(h) = g(h)—g(0) 


and, by the mean value theorem, there is an a such that 0 < « < | 


d 
- A(h) = hg’(eh) = hD f(E+ ah, 9 +h) — Di f(E +h, 1)}. 
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But, since D, fis differentiable at &, 


D,f(é+ ah, 9+h)— Di f(§, 7) 
= Dif(& noh+ Dz Di f(E )h + o(||(@A, h)]|) 
and D,f(é+ah,7)— D,f(&, 9) = Dif(§ 1) xh + o(|\(#A, 9)))). 
Hence, as h > 0, 
A(h)/h? = Dz D,f(é, 1) + oo? + 1) +), 
i.e. A(h)|h? > Dz Di f(5, 1). 
By symmetry, we also have 


A(h)/h? > D,D,f(é, 4) as h+O. | 


Linear derivatives of any order are defined inductively and give 
rise to multilinear functions. For instance D°f(&) is the linear deriva- 
tive at € of the function D?f whose domain is in R” and range in 
L(R”™, L(R™, R")); thus 


D?f(€) € L(R™, L(R”, L(R”, R"))). 


The function D?f(é)(.)(.)(.) is much less complex. It is simply a 
trilinear function from R™ x R™ x R™ to R” and is given by 


DY} Www) = SEE DiDj Def(Eusvy me 


Exercises 7(h) 
1. Let X be an open subset of R® such that tx ¢ X whenever xe X¥ and ¢ > 0. 
Let the function f: X > R! be twice differentiable and homogeneous of degree « 
(see exercise 7(5), 3). 


(i) Prove that D?f(x)(x)(x) = a(a—1) f(x) 


for all xe X. 
(ii) Prove that, if « = 1 and all second order partial derivatives of f are con- 


tinuous, then 
D,D,f(x) ... DnDif (x) 


DiDinf(X) ..» DnDnf) 


for all xe X. 
(The determinant is called the Hessian of f and is denoted by Af(x); it is 
equal to Jg(x), where g = (Dyf,..., Dnnf)*.) 
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2. Show that the function f: R? > R' given by 
xy(x* — y’) 
——__-- ((x, y) + (0, 0)), 
are Sar 9) * 0,0) 
0 (x, y) = (0, 0)) 
is differentiable everywhere and that all four second order partial derivatives 
exist everywhere, but that D,D,f(0, 0) + D,D,f(0, 0).: 


3. The function f from R? to R' is continuously differentiable in an open set G 
‘ containing the point (&, 7)”. Prove that, if D,D,f exists in G and is continuous at 
(€, 7)7, then D,D,.f(&, 7) exists and is equal to D,D,f(€, 7). 

(By considering 


Ath, k) = fE+h, 9+k)—-f(E, 9+ k)—-fE+h, 1) +fE, 1) 
show that, given h, k + 0, there exist a, # such that 0 < «, # < 1 and 


{Di f(E+ah, 9+k)— Dif(E+ah, yk = {DfE+h, 9+ Pk)— DefE, 4 +Pk)}/h; 
and prove that the left-hand side tends to D,D,f(€, 9) as (h, k)™ > (0, 0)”.) 


4. Three sets of conditions for the equality of mixed partial derivatives have been 
given in the present section. Construct a function which satisfies none of these 
sets of conditions but is nevertheless such that D,D.f(é, 7) and D.D,f(&, 7) 
exist and are equal. 


5. Taylor’s theorem in R™. Let X be a subset of R™ and £ an interior point of X. 
The function f: X > R' is such that all partial derivatives of order p are con- 
tinuous in an open ball B(é;r). Show that, when ||| < r, there is a number @ 
between 0 and 1 such that 
1 
fE+h) = fE)+D/OM+ Ti Pay +...+>-—_ ae (£)(h)? 


+5, DY E+ al) (hy, 


(p- 7 


where D?f(£)(h)? = D?f(&)(A) (A), etc. 
(D*f(é)(h)* may here be regarded as shorthand for 


oe Di see Dif Ohi “a9 hizs 
end es 
though, in fact, the continuity of the pth order partial derivatives implies the 
existence of the linear derivatives up to order p. 
Define the real function g by g(t) = f(€+ th) (|t| < r/|Al|) and use Taylor’s 
theorem in R! (Cl, 81); the case m = 2 is dealt with in Cl, 163.) 


6. Let € be an interior point of the set X in R”. Suppose that the function 
f: X > R' has continuous second order partial derivatives in an open set con- 
taining § and that D,f(§) = ... = D,f(€) = 0. Using the notation 


D,D,f() ... DeDy f(E) 


Hy f(E) = | crrcersececseeseceesseeeseees 


DD; f(§) ... DeDef(S) 
for k = 1, ..., m, show that 
(i) if H,f(€) > 0 for all k, then fhas a minimum at €; 


244 FUNCTIONS FROM R™ TO R" [7.8 


(ii) if H,f() < 0 when k is odd and H;, f(€) > 0 when & is even, then fhas a 
maximum at &; 

(iii) if either H,f(£) < 0 for an even k, or H,f(€) > 0 for an odd k and 
also H,,f() < 0 for an odd k, then fhas neither a maximum nor a minimum at &. 

(For the case m = 2 see Cl, theorem 8.8. 

The hypotheses in (i), (ii) and (iii) respectively, are equivalent to the following 
properties of D?f(&): 

Gy D{@j@yz > 0 for alla += @: 

(ii)’ D2f(é)\(u2) < 0 forallu + 6; 

(iii)’ D?f(é)(.)? takes values >0 and <O. 
This is proved in most books on linear algebra (e.g. L. Mirsky, Linear Algebra, 
400-—407).) 


7. Find the maxima and minima of the function f: R* — R' defined by 
IR 2 = SS ee 
What is the least value of fin the closed ball 
{(x, y, z)? |x? + 92+ 2? < a*}? 
8. Show that the function f: R? > R given by 
F(x, y) = x?y— 4x? —2y 


has no minima. Deduce that, if K is any compact subset of R?, then / attains its 
infimum on K at a point of the frontier of K. 


NOTES ON CHAPTER 7 


§7.1. In the context of the long history of the calculus a rigorous treatment of 
functions of several variables is surprisingly new. As late as 1909 the definition of 
differentiability of a function from R? to R' was the subject of a research paper 
(W. H. Young, ‘On differentials’, Proc. London Math. Soc. 7, 157-80). However 
subsequent progress was rapid. By 1925 Fréchet had produced a theory of differen- 
tiation on normed vector spaces; and in 1927 T. H. Hildebrandt and L. M. 
Graves published a parallel but independent account which included an implicit 
function theorem (‘Implicit functions and their differentials in general analysis’, 
Trans. Amer. Math. Soc. 29, 127-53). Recently, Dieudonné’s Foundations of 
Modern Analysis has been most influential in disseminating the modern theory of 
differentiation. 


§7.4. The limitations of the implicit function theorem 7.43 are vividly 
illustrated by the simple case of f(x, y) = (x—y)?. Although the equation 
f(x, vy) = 0 has the unique solution y = x, the conditions of 7.43 are nowhere 
satisfied since D,f(x, x) = 0 for all x. There exists a theory in which differen- 
tiability conditions are avoided, but only at the cost of considerable complica- 
tions. (W. Quade and S. Schottlaender, ‘Uber implizite Funktionen’, Math. 
Zeitschrift 89 (1965), 137-80.) 


§7.6. A converse of theorem 7.61 is true: if the function f= (f,, ...,f,)7 is 
continuously differentiable in an open set G and Jf(x) = 0 for xeG, then 
fi, ---5 fy are functionally dependent in every bounded open subset of G. Proofs of 
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this result are long and not easy. The first one is due to K. Knopp and R. 
Schmidt (‘Funktionaldeterminanten und Abhangigkeit von Funktionen’, 
Math. Zeitschrift 25 (1926), 373-87); a highly sophisticated account has recently 
been given by W. F. Newns (‘Functional dependence’, Amer. Math. Monthly 74 
(1967), 911-19). However if the condition Jf(x) = 0 for x € G is replaced by the 
more restrictive one that the matrix of Df(x) has constant rank r < n, then we 
have the situation of theorem 7.63 where there is, though only locally, the 
stronger conclusion that n—r of the functions f, ..., f, are functionally dependent 
on the remaining r functions. 

The fact that in theorem 7.63 the rank must not be variable is shown by the 
function f = (/1, fo)” : R? > R? defined by 


Aix, y) = x*, f(x,y) = E oS 


0 (x = 0). 


The rank of the matrix of Df(x, y) is 1 when x + 0 and is 0 when x = 0; and it 
follows from exercise 7(f), 8 that, in any open set containing a portion of the 
line x = 0, neither of the functions f,, f, is functionally dependent on the other. 


§7.7. For conditions sufficient to ensure the existence of a constrained maxi- 
mum or minimum at a critical point, see H. Hancock, Maxima and Minima, 
chapter VI. 


§ 7.8. Linear derivatives of arbitrary order are best studied in the general 
setting of normed vector spaces (as in Dieudonné’s book). For even if the function 
fhas values in R”, so that Df(£) has values in the same space, D?*f(&) and higher 
linear derivatives have their values in vector spaces of linear functions. 


8 
INTEGRALS IN R” 


8.1. Curves 


A function f on the closed interval [a, b] (and with range in R”) is 
said to be differentiable at a or b if the appropriate one-sided deriva- 
tives exist at these points (C1, 66); and f is called continuously 
differentiable if the derivative exists at all points of [a, b] and is con- 
tinuous on [a, 6]. (This supplements the definition of continuous 
differentiability for functions with open domains which was so 
extensively used in chapter 7.) The function f is called piecewise 
continuously differentiable on [a, b] if there is a dissection 


ays a <3. < a ea, SO 


such that the restriction of f to each interval [a,_,, a,;] is continuously 
differentiable. Thus, in particular, fis continuous on [a, b] and has 
one-sided derivatives at the points q,. 

We are now ready to discuss the notion of a curve in R” (n > 1) 
or Z, 


Definition. A transit in R” (or Z) is a continuous function whose 
domain is a finite closed interval in R1 and whose range lies in R” (or Z). 

We distinguish between the transit, which is a function, and the 
range of this function, which is a set of points in R” (or Z). Many 
different transits may have the same range. 

To obtain a curve we group together certain transits. Two transits 
@:[a, b] > R® (or Z), w:[c, d] > R” (or Z) are called equivalent, and 
we write ¢ ~ y¥, if 6 = woa, where «@ is a real function on [a, 5] 
with the following properties: 

(i) a(a) = c, a(b) = d; 
(ii) & is piecewise continuously differentiable; 

(ii) fora < t < b, w(t) and, where this does not exist, each one- 

sided derivative of «, is strictly positive. 
It is not difficult to see that the relation ~ so defined is, in fact, an 
equivalence relation (see exercise 8(a), 1). 


Definition. An equivalence class of transits is called a curve. Any 
member of the equivalence class constituting a curve y is called a 
representation of y. 
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All the representations of a curve y have the same range. This set 
of points is called the trace of y; it is compact and connected, being 
the image of a compact and connected set by a continuous function. 
If ¢:[a, b] > R” (or Z) is a representation of y, the points $(a), ¢(b) 
(which are the same for all representations of y) are called the 
initial point and the end point of y, respectively. 

Our definition of a curve is intended to convey the idea that a 
curve is a method of description of a certain set of points (its trace); 
also that such a set of points may be described in many different ways 
which, however, do not differ in essentials. The definition auto- 
matically excludes the possibility of a single curve being indicated 
by both sets of arrows in each of the illustrations below. 


A curve y’ is said to be an arc of a curve y if there exist a repre- 
sentation ¢ of y with domain [a, b] and a representation % of y’ with 
domain [c, d] such that [c, d] < [a, b] and > is the restriction of ¢ to 
[c, d]. 

If a transit is piecewise continuously differentiable, then every 
equivalent transit has the same property. We can therefore speak of 
piecewise continuously differentiable curves. They will be called 
paths. 

A rudimentary algebra of curves is useful. We begin with the 
operation of addition. Let 7,...,;Ym be curves such that, for 
i= 1,...,m—1, the end point of y,; coincides with the initial point 
of y;,,. For each i take a representation ¢* of y; whose domain is the 
interval [i—1,i]. Since ¢*(i) = ¢*+1(@), the function ¢ on [0, m] 
defined by 


g(t) = (ft) for i-l<t<i (@=1,...,m) 


is continuous and so generates a curve y. It is not difficult to see 
that y does not depend on the particular choice of functions ¢* (see 
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exercise 8(a), 3). We therefore define y to be the sum of j, ..., Ym 


and write 
Y = Vit... +Vmne 


Clearly, if y,, ..., Ym are paths, then y is a path. 
If ¢ on [a, b] is a representation of a curve y, then ¢* on [—b, —a] 


aes b() = 6(-) (-b<t< -a) 
generates a curve with the same trace as y, but with initial point ¢(d) 
and end point (a). It is natural to denote this curve by —y. Evidently 


—(-y) =y. A sum y,+(—yY2) is, as usual, written y,—y.. Note 


that 
=O $5 Fe) = Hn 3 Fe 


It is easy to formalize the intuitive idea of the length of a curve. 
To avoid repetition we confine ourselves to curves in R”; the treat- 
ment for Z is entirely analogous. 

Our definition of curve length is derived from the ancient device 
of inscribing polygons. Let ¢ on [a, b] be a transit in R”. For any 
dissection F of [a, b] given by 


C= ix 3, <...< 1; = 1, = 3, 


put AM, 6) = & I9@)- $a. 

We note that, by the triangle inequality for norms, the insertion of 
new points of subdivision increases a sum A(J, 4). 

Definition. If A(Z, %) is bounded for all dissections D of [a, b], the 
transit ® is said to be rectifiable and 


Aa?) = oe ee) 


is called the length of ¢. (For an alternative definition see exercise 
8 (a), 10.) 

Let ¢ on [a, 5], Y& on [c, d] be equivalent transits in R”. Then there 
is a continuous, strictly increasing function « on [a, b] such that 
a(a) = c, a(b) = dand 

A(t) = WiaD} (@<t< d). 
Since the inverse function a7? is also strictly increasing, the relations 
u=a(t) @<t<b), t=a-(u) (c<uK<d) 


set up a bijection between all the dissections of [a,b] and all the 
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dissections of [c, d]. Also, if 2, ’ are corresponding dissections of 
[a, b] and [c, d] respectively, then 

AQ, ¢) = AD", p). 


Hence ¢ and y are either both rectifiable or both not rectifiable and, 
if they are rectifiable, they have the same length. It follows that we are 
justified in applying the terms rectifiability and length to curves as 
well as to transits. We shall denote the length of a rectifiable curve 


y by A(y). 


Theorem 8.11. The curve in R” represented by ¢ on [a, b) is rectifiable 
if and only if each of the components $y, ...,¢n of ¢ is of bounded 
variation on [a, 5}. 


Proof. For y = (y1 -»-» Yn) in R” we have 
Ive < [yl < Dil+..+[ynl @= 1... 
Therefore, for any dissection J of [a, b] and i = 1,..., 7, 
V2, $:) < AG, ¢) < VF, o)t+...+ VG, on) (8.11) 


and these inequalities lead to the required result. | 


Theorem 8.12. Let Y, V1, Y2 be curves such that y = Y1+%2. Then (i) y 
is rectifiable if and only if y, and yz are rectifiable and (ii) when y is 


eee A(y) = Ay) + A()- 


The proof is very similar to that of theorem 6.42 and is left to the 
reader. Part (i) is also an immediate consequence of theorems 8.11 
and 6.42. 


Theorem 8.13. Every path in R” is rectifiable; and, if ¢ on [a, 6] is a 
representation of a pathy in R", then 


a) = | Is'l. (8.12) 


Proof. The function ¢ is piecewise continuously differentiable. 
Hence [a, b] may be divided into a finite number of subintervals in 
each of which the components ¢,, ..., 6, have bounded derivatives. 
Thus ¢,, ..., ¢,, are of bounded variation in [a, b] and so, by theorem 
8.11, y is rectifiable. We next note that the integral in (8.12) exists 
although the integrand may be undefined at a finite number of 
points; for, elsewhere in [a, 5], ¢’ is bounded and continuous. To 
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prove the identity (8.12) we shall suppose that ¢ is continuously 
differentiable throughout [a, b]. This is permissible by theorem 8.12 
and exercise 8(a), 4. 


Take any € > 0. Since ¢4, ..., 6, are uniformly continuous, there 
is a 0 > 0 such that, for 7 = 1,..., n, 


|p;(u) — $;(v)| < € whenever a < u,v < b and |u—v| < 6. (8.13) 
Also there is a dissection F of [a, b] such that 
A(, $) > Ag(d)—e = A(y)—-«. 


We add, if necessary, points of division to Z to form a dissection 
Q* for which “(B*) < 6. Then 


A(D*, 6) > AF, o) > A(y)—e. (8.14) 
Let the points of division of Z* be 
a= i, <4 <3, <4; < £, =. 
For each j (j = 1, ..., m), the interval (t;_,, t;) contains a point 7,, 
such tat $M) 6-9) = Bt) 4-0. 
Therefore, by (8.13), for every ¢ in [t,_,, t,], 
$4) -$(4D-POG-tD| < 4-t4) G=1,....7) 


| $C) - PG) -19'O| a-ti-D| < ne(t;--t,4). (8.15) 

But, by exercise 6(a), 8, there is a u; in [t,;_,, t,] such that 

[lel = leah e—1-0. 
Hence, by (8.15), 

= 
I9)-9ed1-f 16 
This holds for i = 1, ..., m. Adding these inequalities we obtain 

Aas, )-f Io" 
and therefore, by (8.14), 


sqft 


< ne(t; —_ t,_4). 


< ne(b—a) 


< [l+n(b—a)]e. | 
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Exercises 8 (a) 


1. Prove that the relation ~ between transits is an equivalence relation. 


2. A curve in R” is said to be smooth if one of its representations ¢ : [a, b] > R” 
is continuously differentiable and such that ¢’(t) + 9 in [a, b]. Show that not all 
representations of a smooth curve are continuously differentiable. (Contrast 
paths all of whose representations are piecewise continuously differentiable.) 


3. Show that, in the definition of the sum y,+...+/,, of the curves y; (i = 1, ..., m) 
in R", the choice of the representations ¢* : [i—1, i] > R” of y, is immaterial. 


4. Let ¢ on [a, b] be a representation of a curve y and let ue (a, b). Denote the 
arcs corresponding to the restrictions of ¢ to [a, u] and [u, b] respectively by 
Y, and y,. Prove that y = y1+72. 


5. A rectifiable curve y has initial point p and end point q. Prove that 
A(y) = ||p—al.- 


6. Curves 71, Ys, 73 in R? are given by the functions ¢* = (gi, 0) (i = 1, 2, 3), all 
on [—1, 1], where 


i(t) = t, 

i(t) = — sin p71, 

a(t) = | ? —— 
—tcos(zm/t) (0 < |t| < 1). 


Show that 71, 72, 73 have all the same trace and the same initial and end points, 
but that y,, 7, are rectifiable whereas y, is not. Show also that A(y,) + A(72). 


7. Let p, g be coprime positive integers. Find the length of the epicycloid 
represented by ¢ = (¢;, $2) : [0, 2g7] ~ R?, where 


$,(t) = (2+ i acos t+acos (2+ i t, - off) = (2+ y asin t+a sin (2+ 7 ft. 
(The curve is generated by the movement of a point fixed on a circle of radius a 
as this circle rolls on a fixed circle of radius (p/q)a.) 


8. Find the length of the curve in R* represented by ¢ on [0, 2], where 
A(t) = (tcost, tsint, t) (0 < t < 2). 


9. Let ¢ on [a, b] be a rectifiable transit in R”. Prove that the function s on 
[a, 6] given by 0 tor? =a 


ie 5 pe fora<t<b 


is continuous on [a, 5]. (Arc length is a continuous function of the ‘parameter’ 7.) 


10. Let ¢ on [a, b] be a transit in R”. Show that ¢ is rectifiable if and only if 
lim A(Q, ¢) exists and that, if the limit does exist, then it is equal to A®(¢). 

U(D)—0 

(Cf. theorem 6.71 and exercise 6(g), 3.) 
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11. A space filling curve. The continuous function f: R! > R1is defined as follows: 
f has period 2, is even and f(t) = 0 for rt é [0, 4], f(t) = 1 for te [4, 1], fis linear 
in [4, 4]. The functions ¢, % : [0, 1] ~ R! are now defined by 


X(t) = Ot GIT. tO MT a 


es I 1 3 1 2n—-—1 
W(t) = 5fG31)+ fC t+... +5FG t)+.... 
Show that ¢ and yw are continuous and that the curve represented by (¢, v) 
passes through every point of the square [0, 1] x [0, 1]. 
If €, 7 € [0, 1], let 2 : 
as, 7 € 10, 1] & = 0°13... 9 = 0° fos... 


in the binary scale (so that each «; or f/; is 0 or 1). Let 7 € [0, 1] be the number 
defined in the ternary scale by 


T = 0° (2,) (2/1) (22) (22)... 
and show that = (7), 7 = y(7).) 


8.2. Line integrals 


Let G be an open subset of R”(n > 1). We wish to define an 
integral of a continuous function f:G > R” along a rectifiable curve 
y in G. (We say that ‘a curve lies in a set’ when the trace of the curve 
is contained in that set.) 

Take a representation ¢:[a, b] > R” of y and a dissection D of 
he eee Q=e%) <4 <2. fey = = os. 

When t;_, < 7; < t;, f{P(7,)} is a value of f on the trace of the arc 
corresponding to [t,_,, t;] and we accordingly consider a sum 


z FKOT)} {P(G) -— Oj} = ; 2 FiP(T) PUG) — Pj} 
tt E HOME) PG} 8.21) 


Each function ¢,; is of bounded variation on [a, b] and, since f and ? 
are continuous, fo ¢ is continuous. Hence f,0¢ € R(¢;; a, b) for 
i = 1,...,n, and, by exercise 6(g), 1, 


ELIA) - $d} > | Kio Bab 
as (2) > 0. Thus the sum in (8.21) converges to 


3 [io db. 
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But we cannot define this expression to be the integral of f along y 
until we have shown that it is independent of the particular repre- 
sentation ¢ of y. 


Theorem 8.21. Let G be an open set in R” and suppose that the function 
f:G > R" is continuous. If y is a rectifiable curve in G and $:{a, b| > R", 
W:[c, d] > R® are both representations of y, then’ 


3 [ho vas. = 3 [ Vio wna 


Proof. Since ¢ ~ wy, there exists a continuous, strictly increasing 
function «:[a, b] > [c, d] such that a(a) = c,a(f) = dand¢ = you. 
By theorem 6.53, for each i, 


d b b 
['Gioman = [Keo oa dire) = | io Pad | 
We are now in a position to define the first of the two kinds of 
line integral. | 
Definition. Let G be an open set in R" and suppose that the function 
f:G > R" is continuous. If y is a rectifiable curve in G, then | ff, the 
= 


integral of f along y, is defined by 
n b 
[ r- 3 [ ie ob. (8.22) 


where ¢:[a, b] > R” is any representation of y. 

When y is a path the sum on the right in (8.22) reduces to a single 
Riemann integral. For then ¢ is piecewise continuously differentiable 
and each ¢, is the difference of two increasing functions which are 
piecewise continuously differentiable (see exercise 6(/), 13). There- 
fore, by theorem 6.21A, 


J Kio YOM = | : (fio PO Gi(tat 
= [r= 3 [ ie wsi= [ve 0.9" 


The form — | {AGO MO +... +H(GO) OO} Ae 
of | f (when y is a path) suggests the notation 
y 


[ fires - [ reo@eiwae 
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Therefore | f is often denoted by 
7 
| (f(x) dx, +... +f,,(x)dx,). 


Illustration. If y is the path in R? given by 


x=cost, y=sint (0 < t < 27) 
then 


| oryar—x2ay a [cost sin?¢—cos*t)dt = — [" costeat = —7, 
The path y is called the unit circle described in the positive (anti-clockwise) 
direction. A similar phrase may be used for the path given by 
x=a+trcost, y=b+rsint (0 < ¢t < 27). 
Another frequently encountered path (in R”) is one represented in the form 
x=pt+(q-p)t O<r< 1) 
This is what will be understood by the (straight line) segment from p to q. 


In the next theorem we collect the elementary properties of line 
integrals. 


Theorem 8.22. Let y, Y;, V2 be rectifiable curves in an open subset G of 
R® and let the functions f, g:G > R” be continuous. Then 


Ofaro-l[r+[s 
(ii) : tf = c : Ft (where c is a real number); 


Gi) | f= -| fs 


(iv) f= a + : Ff (when y, +72 is defined). 


LVitYa 


Proof. Parts (i) and (ii) are obvious and (iii) follows from the 
identity (see theorem 6.53) 


| nto—naec—9 = ["rtg@ 48.0 = -[ 460} 48.0. 
To prove (iv), let ¢ on [0, 1] and ¥ on [1, 2] be representations of 
y¥, and ¥, respectively. Then y,+/. is represented by y on [0, 2], 
—— ) - (9 for O<rs, 
OOS agy for” Vee $ 
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and fF hrowde = [io Ddde+ [io Wale | 


Integrals with respect to arc. The integrand is now real valued, not, 
as previously, vector valued. Given an open subset G of R”, a con- 
tinuous function f{:G > R! and a rectifiable curve y in G, we define 
an integral of f with respect to the arc length of y. 

Let ¢:[a, b] ~ R” be a representation of y and define the increasing 
function s,:[a, b] > = ' 


oa 0 fei = a, 
= =| ey fora < 1 <-d. 


For a dissection J of [a, b] into subintervals [to, t,], ..., [t,-1, t,] we 
consider the sum __, 

ZF {P(Ts)} Spey) — S(t;-w}> 
where t;_, < 7; < t;. Since fo ¢ is continuous and s, increases, this 


sum tends to ; 
[. (fo ¢)ds, (8.23) 


as u(D) + 0. 
Now, if %:[c,d]— R® is another representation of y, so that 
é = woa, where « satisfies the usual conditions, then 


AY, (¢) = AKOW) @<t < d), 
Le. Sg = Syou = SyO% 
Hence, as in the proof of theorem 8.21, 
d b = b 
[vo wds, = [fo oa))d(sy0.) = | (fo dds 


Thus the integrals (8.23) do not depend on the particular representa- 
tion ¢ of y and we can therefore denote them all by 


| 74 or | fooas 


a 
When y is a path, then, by theorem 8.13, s, is an indefinite integral 
of ||¢’|| and so, by theorem 6.21B, 


[ 7a - [ro #)ds, = [ ro #) I. 
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Line integrals with respect to arc have the properties (i), (ii) and 
(iv) stated in theorem 8.22. However (iii) is replaced by 


(iii | _fas = | fas. 


For if ¢:[a, b] > R” is a representation of y, let ¢*:[—b, —a] > R" 
be the representation of —~y defined by 


O*(t) = G(-1) (-b <t< —a). 
Then clearly Sge(t) = A(y) —sg(— 24) 


and 50" pH(D} dil) | “Fi(—}aaq)-s(-9) 


- |“ Ft(-9}45,(—9 


[ Pe@sasy0. 


In analysis, line integrals with respect to arc are comparatively 
unimportant. However, in mathematical physics, they have a part 
to play. (See exercises 8(5), 9 and 10.) 


Exercises §(5) 


1. Let G be an open subset of R" (n > 1); let y be a path in G and suppose that 
the functions /: G > R", g:G > R? are continuous. Prove, without using the 
Stieltjes integral, that, if ¢: [a,b] > R" and y:[c, d]— R® are both represen- 


tations of y, then > a 
[ von.8 =| row 


b d 
and [eoste'l = [eon 

a Cc 
2. Prove theorem 8.22 (iii) and the corresponding result for integrals with 
respect to arc, without making use of the Stieltjes integral, when y is a path. 


3. Let G be an open subset of R” and suppose that the function f: G > R® is 
continuous. Show that, if y is a rectifiable curve whose trace E lies in G, then 


J 


< [ [fds < MA), 
where M = sup ||f(x)|. 
xreH 
4. Let y be the triangular path in R* consisting of the segments from (0, 0) to 
(1, 0), from (1, 0) to (1, 1), and from (1, 1) to (0, 0). Evaluate 


(ae 
Ae oes oe 
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5. Given that a + 0, let y be the path in R® defined by 
x = a(i+cos2t), y=asin2t, z= 2acost (0<t< 2n). 


Show that the trace of y is the intersection of a (hollow) cylinder and a (hollow) 
sphere. Evaluate 


[ {(y? + 2?) dx+(z?+ x) dy + (x?4+ y”) dz}. 
Y 


6. The function f: R® > R? is defined by 
f(x, y, z) = (yz, 0, xy); 


and y is the (helical) path given by 
x=acost, y=bsint, z=ct (0<t < 4a). 


Evaluate [ f, and also [ f, where ¢ is the segment from the initial point of y to 
Y ee . 
the end point of y. 


7. Let G be an open set in R”. Let the function g: G — R' be continuously 
differentiable and let the function f: G > R” be defined by 

f= grad g = (Dig, ..., Dn). 
Prove that, if the path y with initial point u and end point v lies in G, then 


[ f = g(v)—glu). 
i 


(Thus | fhas the same value for all paths y in G which have the same initial 
e 3 
and end points; the integral is independent of path (contrast 6).) 


8. Let D be an open connected set (region) in R”. Prove that, if the func- 
tion f: D > R" is continuous and the line integral of f is independent of path, 
then there exists a continuously differentiable function g: D > R' such that 
f = grad g. 

(Thus, in view also of 7, in a region D the integral of a continuous function 
f: D > R” is independent of path if and only if f = grad g for some continuously 
differentiable function g: D > R'. By theorem 7.33, any two functions g differ by 
a constant. 


For the proof, take a fixed point a e€ Dand, for any x € D, define g(x) = [ ne 


Ya 
where y, is any path with initial point a and end point x; such paths exist, by 
theorem 3.32.) 


9. The ends of a uniform chain of length 2/ are held at the same height and a 
distance 2a(<2/) apart. It is known that, if the co-ordinate system is suitably 
chosen, the form of the chain is given by 


x=t, y=ccosh(t/c) (-a<t <a), 


where c is a constant. Show that c is uniquely determined by a and /. Show also 
that the centroid of the chain is a distance 


(1 (c2-+ 12) — ac) /(21) 


below the ends. 


(The centroid (,, ..., €,) of a curve y in R” is given by &;A(7) - i x; ds.) 
Y 
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10. A wire is shaped in the form of the twisted cubic given by 
Keesha pe 3, ners! oO ere 2) 
Show that the total length of the wire is 10, and that the arc length between the 
origin and the point (x, y, z) is ./2z. 
Find the moment of inertia of the wire about the y-axis, if its mass per unit 
length varies uniformly with arc length, from 0 at the origin to 1 at the other end. 
(If a wire is shaped as a curve y in R°, its moment of inertia about a straight 


line A is i up* ds, where “(x, y, z) is the mass per unit length at (x, y, z) and 


¥ 
p(x, y, z) is the distance of (x, y, z) from A.) 


8.3. Integration over intervals in R” 


We first dispose of some matters of terminology. By a closed 
interval in R™ we mean a set J of the form 


[a;, OF * 2 eid, OL 
where a; < b; fori = 1,..., m. The interval is cubical if 

bj a, = 2b, —G, 
The faces of I are the 2n sets 

Ef X45 05 Me = yy FOG. 6 NX, = eee. 
The volume V(1) of I is defined to be 
(b, —a,)...(b,, —an). 
If Z, is the dissection of the interval [a;, b;] given by 
G = xe HS PS x Se 


the dissection J = Y,x...xQ, of J is the division of J into the 
r = 7y...r,, subintervals 

Eo mere gre. es 
where 1 < j; < 7,....1 <j, < ry. TO define integration over J we 


now proceed in exactly the same way as in the one-dimensional case. 
Let the function f:J — R! be bounded. Given a dissection 


Gm Da ee, 
of J into subintervals fi, ..., J., put 
m, = inf f(x), M, = sup f(x) 
lor] = 1,...,F-and a 
(2) = (B,f) = Sm; Vil), S(D) = SB,f) => M,V(L). 
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These sums are, as usual, called Jower and upper sums respectively. 

Further subdivision of J clearly increases the lower sum and dimin- 

ishes the upper sum. It follows as in §6.1 that, if M and Y’ are any 

dissections of J, (D) < SZ). (8.31) 
We now write 


sup (F,f) =f f, int S,f) = | f 


and call the first expression the Jower integral and the second the 
upper integral of f over I. By (8.31), 


| fs { J (8.32) 


If there is equality in (8.32), fis said to be integrable over I and the 
common value of the upper and lower integrals, called the integral 
of f over I, is denoted by 


[, 7 OF t I(x) dx. 


An integral over an interval in R”, wheren > 1, may, for emphasis, 
be called a multiple integral. In R? and R® the corresponding terms 
are double integral and triple integral, respectively. These integrals 


are often written 
[[7 or {{ I(x, y)dxdy 
 § I 


and {| S 2 or [[[ fe y, z)dxdy dz. 


Multiple integrals and integrals in R1 have the same elementary 
properties. The proofs of the following statements are easy adapta- 
tions of their analogues in chapter 6. 

(i) If f, g are integrable over /, then so are |f|, 1/f provided that 
inf f(x) > 0, af+ bg (where a, 5 are real numbers) and fg; also 


[f < [ls 


and [ (af+bg) = af f+] 


(ii) Let J,, J, be intervals which have a face in common and are 
non-overlapping (i.e. have no common interior points) so that 
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f, UJ, is an interval (distinct from J, and from /,). Then, for any 
bounded function fon J, u 4, 


| I, sae i | Page and :. ; a a {s + [f (8.33) 


Thus, if fis integrable over J, and over J,, then f is integrable over 


I, U J, and 
Jaw Sift fF 


It also follows that integrability over an interval implies integrability 
over any subinterval. 

The multi-dimensional form of Darboux’s theorem (6.71) is 
important. For a dissection Y of the interval J into subintervals 
I,,...,1,, the mesh u(Z) of Z is the largest of the diameters 


pl), eee pU,) 
(see exercise 3(d), 7). If now fis any bounded function on J, 
[ f=jims@,/) and [ r= lim S(,), 
I 


(D)-+0 M(Z)—>0 


Now let f be a real-valued function on the interval K = 7xJ of 
Rv = R?, where 1 — R™ and Jc R”. Suppose that, for every 
y EJ, the function /(., y) is integrable over / and that the function 


@ on J given by 
69) =| fs ade 


is integrable over J. We then write 


eat 
ws [a] fen sdas 


and call this expression a repeated (or iterated) integral. The same 
decomposition of K leads to the integral 


[ax | sos na 
(if it exists). 
A repeated —— may involve any number of integrations. For 


instance if 
= [a,, b,] Xie X [ay bo], 
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one such integral is ; : 
| hed ee. Xp) AX}. 
ap a 
The next theorem and its corollaries serve to connect the integral 


of f over K and the associated repeated integrals; exercises 8(c), 6-9 
emphasize the distinctiveness of these integrals. 


Theorem 8.31. Let I,J be closed intervals in R” and R” respectively 
and suppose that the function f:IxJ — R°* is integrable over Ix J. 
Then the functions $, y on J defined by 


60) = { fos dx, v0) = | fos nde 
are integrable over J and 


[ | Ses ydx =| fos ydedy = it a | fos y)dx. (8.34) 


Proof. Let @ be any dissection of [x J. Then @ is of the form 
D,xD,, where D;,D,; are dissections of J and J into subintervals 
I, .... 1, and J,,....d, wespecuvely. “The subintervals of JxJ are 
therefore I, x J; (k = 1,...,.7r3;/ = 1,..., 8). 


Put M,, = sup f(x,y) N= ap Wy) 
Ed] 


(2, y)ELEX SI 


and, for any y EJ, let 
My) = sup f(x, y). 
If ye J,, we have 


EMM) > SMU) > | fos nde = YO) 
and, since this is true for all y €/;, 
Mai) = iy. 


Also VI, xJ;) = V(Ii,)V(J,) and therefore 
YS UMW Ex) 2 UNV) 2 | v. 
11=1 1=1 J 


ee 


This inequality holds for all dissections of 1x J. Hence 


aes “ {, = {a { fos y)dx. 


262 INTEGRALS IN R" [8.3 


An argument similar to the one above shows that 


L? <[_ ay | fos y)dx. 


To prove the integrability of 7% and the right-hand identity in 
(8.34) we also use the inequalities 


I, dy | es (x, y)dx < | : dy {s (x, y)dx < [ dy { f(x, y)dx, 


for these then give 


<[ar| feande< fd [ feeds ff 


Ix y 


In the last theorem the functions ¢, y may, of course, be replaced 
by ¢*, &*, where 


#@) =| Sand, PO =| ford ed. 


We are therefore led to a sufficient condition for the invertibility 
of the order of integration in a repeated integral. 


Corollary 1. Let I, J be closed intervals in R™ and R” respectively. If 
(i) the function f:I x J > R° is integrable, 


(ii) | T(x, y)dx exists for every y € J, 
I 
(iii) | I(x, y)dy exists for every x ET, 
7 
then | dy | I(x, y)dx and | dx | T(x, y)dy 
a te & : eee 2 


exist and are equal. 

It is easily seen that every continuous function is integrable (cf. 
theorem 6.12(i)): The result of corollary 1 therefore certainly holds 
if fis continuous on J x J. (The existence of the two repeated integrals 


is also a consequence of the fact, to be proved in§8.7, that | Fix; Jex 
 § 


and | f(.; y)dy are continuous on J and J respectively.) 
: 


Repeated application of corollary 1 shows that, for continuous 
functions, the process of multiple integration may be Be by a 
number of single integrations. 
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Corollary 2. Let I = [a,, bi|x...*[dn, Bn] be an interval in R”. If 
the function f:I > R' is continuous, then 


Dy by 
[ few Se ae | wee. | ee) ue, 
an MH 


Moreover the integral on the right may be replaced by 
Din bi, 
| dey, s. { flop ong Xp) Dig 
Ain, Qi, 


where (i,,..+, i,) is any permutation of (1, ..., 1). 
Another corollary is a relation between double integrals and line 
integrals. 


Corollary 3. (Green’s theorem for intervals in R*.) Let I be the 
interval [a, b] x [c, d] in R? and denote by @I the path determined by the 
four straight line segments from (a, c) to (b,c) to (b, d) to (@, d) to 
(a, c). If G is an open set containing I and the function 


f= (fr f)iG> RB 


is continuously differentiable, then 
[f-|[ @r-Df. 
ol I 


Proof. In the statement of the corollary we have used our con- 
vention regarding the paths determined by straight line segments 
(see p. 254). Thus 62 = ¥1+¥2—Ys—Ya Where V1, Ya. Ys, Ya May be 
represented by « 


ty=e (e564 Pb), x= by=t {eat 


X 


> 


< d) 
ty=d (a<t<b), x=ay=t (C<t<d), 


Xx 


respectively. We now have, by corollary 2 and exercise 6(h), 10, 


{I Piha = [. dy [. D, fx, y) dx 
-['Kona-[fana=| s-] # 


and similarly {f, De f f- , f. | 
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Exercises 8(c) 


1. The real-valued function fon [a, b] x [c,d](< R?) is increasing in each variable, 
1.€., for each y €[c, d], f(x, y) increases with x and, for each x € [a, bj, f(x, y) 
increases with y. Prove that f is integrable over [a, b| x [c, d] (cf. theorem 
6.12 (ii)). Extend this result to functions on intervals in R" (n = 1, 2, ...). 


2. Let I be a closed interval in R” and suppose that the function f: J > R! is 
integrable over J. Define the function ¢: 7x R" > R* by 


(x,y) =f) (xeL ye R’). 


Prove that ¢ is integrable over every interval K = IxJ in R™*", 
Suppose, in addition, that the function g: J > R! is integrable and that the 
function h: K > R! is defined by 


A(x, y) =fx)e(y) weLyed). 
Show that A is integrable over K and that 


Je*= ({/) (J): 


3. Prove that, if a,b > 0, c + 0 and J = [0, a] x [0, 5], then 


rXPt+y?+c%)? Cc x(a" + b? + c*)" 


4. If J is the interval [—1, 2] x [—1, 0] x [0, 2] of R?, show that 


2 
| (5) dxdydz = 9 log 6. 
r\l—|xly 


I ay | VJ(1—y cos?x)dx (0 < a < In), 
0 0 
6. The function f on J = [0, 1] x [0, 1] (< R?®) is defined by 


5. Evaluate 


if at least one of x, y is irrational, 
f(x,y) = = 


if y is rational and x = p/q, 


where p, g are coprime integers and g > 0. Prove that 
1 1 
[fos ndrdy and [ay [sex node 
I 0 0 


exist and are equal, but that 1 , 
I ie I (Gi Pidy 
0 0 

does not exist. 

Find a function h on J such that | h exists, but neither of the associated 

I 

repeated integrals exists. 
7. Define the function f on J = [0, 7] x [0, 1] by 
cos x_ if y is rational, 


f(x, y) = 


0 if y is irrational. 
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Show that 1 ™ 
: dy [ f(x, y) dx 


exists, but that : ; 
i eo [ ro { AO Hid 
I 0 0 


do not exist. 


8. Let (71, Po, D3, ...) be the sequence of primes (2, 3, 5, ...); let 


na(2 
Pr Px 


and let P = tL) P,. 
k=1 


Show that the set P is dense in [0, 1] x [0, 1], but that any line parallel to the axes 
contains no more than a finite number of points of P. 
Define the function f: J = [0, 1] x [0, 1] ~ R! by 


f(x, y) = 


m,n = 1, ss Pa=1} 


1 forxeP, 
0 forxel—P. 


1 1 1 1 
Show that I dx [ f(x, y)dy and { dy I I(x, y)dx 
0 0 0 0 
exist and are both 0, but that I Ff does not exist. 
I 


9. The function /: [0, 1] x [0, 1] > R! is defined by 


iif fO< x. <:y <4, 
fixe vie = —iix* 7w0< y < x < I, 
0 otherwise. 
Prove that 


[a | fos nde and [dx [fc vay 
0 0 0 0 


both exist, but are unequal. (By theorem 8.31, f cannot be integrable over 
[0, 1] x [0, 1]; in fact, fis unbounded near the axes.) 


8.4. Integration over arbitrary bounded sets in R” 


We often wish to integrate functions over sets which are not 
intervals. There is a simple device for doing this (which applies to R! 
as well as to multi-dimensional Euclidean spaces). 

Let E be a bounded subset of R” and suppose that the function 
Jf: E — R' is bounded. Define the function /,: R” > R' by 


IQ). £ xe#, 
Fux) = O-"if  xek. 
It then follows easily from (8.33) (see exercise 8(d), 1) that 


[fe and | fe 
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have the same values for all closed intervals J containing E. We call 
these two numbers the lower integral and the upper integral of f over 


E, denoting them by = 
{ f and | f 
JE E 


respectively. If the upper and lower integrals are equal, their common 
value is called the integral of f over E£; it is written 


Jf 


and fis said to be integrable over E. 

When f = 1, fy, is the indicator (or characteristic) function x; (first 
mentioned in §6.8). The inner content c(E) of E and the outer 
content c(£) are defined by 


c(E) = ° l, “y= f: 1. 


If c(E) = c(£), the common value is called the content of E and is 
denoted by c(£). (Cf. §6.8.) Clearly, the content of a closed interval 
is its volume. It is also easy to verify that the content of an open 


interval (a, b,) ere (a,> b,,) 


is (b;—a,)...(5, —a,), the content (or volume) of its closure (exercise 
8(d), 2). For other elementary results on integration and content see 
exercises 8(d), 3-7. 

The theorem below generalizes theorem 6.81. 


Theorem 8.41. 
(i) The bounded set E in R” has zero content if and only if, given 
é > O, there is a finite number of closed intervals (or a finite number of 
open intervals) which cover E and whose total content is less than e. 
(ii) The statement (i) is true when all intervals are restricted to be 
cubical. 


Proof. We prove (i) and (ii) at the same time. 
First suppose that c(Z) = 0. Let K be any cubical interval con- 


taining E. Since | Xe = 0, Darboux’s theorem shows that there is a 
K 


dissection Y of K into cubical subintervals such that S(F, yz) < ¢; 
and S(Y, vz) is the sum of the contents of the closed cubical sub- 
intervals K,, ...,.K, which contain points of E. 
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Each K; may be replaced by a larger open cubical interval so that 
the content of the open intervals is still less than e. 

Now suppose that, given e > 0, there is a finite number of open 
intervals J, covering E and of total content less than e. Let J be a 
closed interval containing all the J, and form a dissection Y of J into 
subintervals J,,..., J. by producing the faces of the J,. (The process 
for n = 2 is illustrated in the figure; the intervals J; are shaded.) 


i 


Go ose 


Since the J; in J— UJ; do not contain points of E and the total content 
of the rest is less than e, S(Z, vz) < ¢. This holds for all ¢ > 0 and 


so | XE bce 0. 


If we start with closed intervals, we may enlarge them slightly into 
open ones and then proceed as before. | 


We have previously noted that continuous functions are integrable. 
This result can be significantly improved. 


Theorem 8.42. If f is bounded on the closed interval I and the set of 
points of discontinuity of f has zero content, then f is integrable over I. 


The proof, in which the points of discontinuity are covered by open 
intervals of small content, is easily adapted from the proof of the 
analogous theorem 6.82. 

An informal interpretation of the next theorem is that the sets 
with content are those whose edges are not too blurred. 


Theorem 8.43. A bounded set in R” has content if and only if its 
frontier has zero content. 
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Proof. Let E be a bounded set in R” and denote its frontier by F. 
The function y, is continuous on the open set F’. Hence, by theorem 
8.42, if c(F) = 0, then yz is integrable over every interval containing 
E, 42: CGE) exists: 

Now suppose that F does not have zero content, so that 


[ 1=a>0. 
F 


If J is an interval containing F, then S(G, x,) > a for every dis- 
section JZ of I. Take J to be a cubical interval whose frontier lies in 
E' and take 9 to be a dissection of J into congruent cubical sub- 
intervals /,, ..., 7,. 

Let € ¢ F. If & is in J, and in no other subinterval, then 


sup ¥z(x)—inf v,(x) = 1. (8.41) 
rely xely 


If, however, § is on the frontier of two or more of the /,, then (8.41) 
need hold for no more than one of these. Since at most 2” of the J, 
contain any given point of F, (8.41) holds for at least a fraction 1/2” 
of the J, meeting F. Hence 

SQ, Xz) —- AF, Xz) 2 a[2” 


and so, by Darboux’s theorem, 
| xe-[ Xe 2 al2” > 0. | 
I Jt 


A combination of theorems 8.42 and 8.43 gives a particularly 
useful condition for integrability. 


Theorem 8.44. Let E be a subset of R” which has content. If the 
bounded function f:E — R' is continuous except on a set of zero 
content, then f is integrable over E. 


Proof. Let X be the subset of E on which fis discontinuous. Then 
fz is continuous on (£)’ and on E°—X. Hence, if F is the frontier 
of E, all discontinuities of f;, lie in the set X U F which has zero 
content. It follows from theorem 8.42 that f, is integrable over every 
interval containing E. | 


Theorem 8.45. The trace of any rectifiable curve (in R") has zero 
content. 


Proof. Let the rectifiable curve y have trace I and length A. If 
yr :[a, b] > R” is a representation of y, the function s on [a, 5] given 
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by s(t) = A4,(W) is continuous (exercise 8(a), 9). Hence there are 
siege fer <<< Oe a =D 
such that @)—-se) = Arr (j= 1,...,7). 


Now, for each j, let K; be the cubical interval with centre (u’) and 
sides of length 2A/r. Then, by exercise 8(a), 5; the K; cover I’ and 


their total content 1s (ay . (2A) 


2r\" 
Aes Bae = See. 


which may be made arbitrarily small by taking r sufficiently large. | 


The previous theorem is mainly useful in R?; in higher dimensional 
spaces the sets of points with which it deals are not ‘large’ enough. 

Let m,n be integers such that 1 < m <n. A surface in R” 1s 
defined to be a continuously differentiable function ¥:G —> R’, 
where G is an open subset of R™. Also, if H is any compact subset of 
G, we call the restriction of y to H a surface section and the set f(H) 
its trace. A continuously differentiable transit is a surface section for 
which m = 1 and H is a finite closed interval. (The notes at the end 
of this chapter include some remarks about surfaces.) 


Lemma. Let G be an open subset of R" and suppose that the function 
@:G — R® is continuously differentiable. If the set E has zero content 
and E < G, then the set ¢(E) also has zero content. 


Proof. By theorem 8.43, E has zero content. It is enough to prove 
that d(£) has zero content. Given e > 0, there are closed cubical 
intervals J,,..., J, which cover E and the total content of which is 
less than e. Since the distance between the compact set E and the 
closed set G’ is positive (exercise 3(f), 14), we may suppose that each 
I; is contained in G. Put E 

Ul, = H. 
j=l 

The function D¢:G —>L(R”, R”) is continuous and therefore 
bounded on the compact set H. Let 


sup || Dé(x)|| = M. 
Then, by theorem 7.31, 
|gX)—-¢)|| < M||x—-yI 


whenever x,y EJ; (j= 1,...,7). Hence, if the sides of J; are of 
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length s,, 6(/;) can be enclosed in a cubical interval whose sides are 
of length (,/n)s,. Thus ¢(£) can be covered by a finite number of 
cubical intervals whose total content is less than (M./n)"e. | 


Theorem 8.46. The trace of a surface section in R” (n > 1) has zero 
content. 3 


Proof. Let the trace in question be y(H)(< R”), where y is a 
continuously differentiable function on the open set G in R”, where 
1 < m < nand His acompact subset of G. The set 


Hi Uy, Met oe Ee 


in R” is clearly compact and has zero content. Also the function y* 
on the open subset G x R”-” of R” given by 


i538; SY EN 5) 


is, by consideration of its partial derivatives, seen to be continuously 
differentiable. Since Y(H) = w*(H*), it therefore follows from the 
lemma that Y(H) has zero content. | 


We now introduce a convenient phrase. Let E' be a subset of R” 
and suppose that every point of F either does or does not possess a 
certain property P. If P holds for all points of E except for those in a 
set of zero content, we shall say that P holds nearly everywhere in E 
(or, for nearly all points of £). The n-dimensional form of theorem 
6.83 may then be stated as follows. 


Theorem 8.47. Suppose that f and g are bounded on a bounded subset 
E of R" and that f(x) = g(x) nearly everywhere in E. Then either both 
f and g are integrable over E or neither is integrable; and, if f, g are 


E 


The proof is similar to that of theorem 6.83. The theorem shows, 
as was pointed out in similar circumstances in §6.8, that the process 
of integration over a‘given set_does not require the integrand to be 
defined everywhere on that set—nearly everywhere is enough. This 
remark leads to a useful corollary of theorem 8.31. 
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Theorem 8.48. Let I, J be closed intervals in R™ and R" respectively. 


If 
(i) the function f:I x J > R' is integrable, and 


(ii) d(y) = | F(x, y)dx exists for nearly all y € J, 
Lf 
then ¢@ is integrable over J and 


[| fend =| fe. yard. 


This theorem is the basis for evaluating multiple integrals over 
sets which need not be intervals. We illustrate the method with a 
simple example. 

Let the functions u, v:[a, b] > R! be such that u(x) < v(x) for 
a<x <b and let E be the set of points (x, y) in R? such that 
a<x <bandu(x) < y < v(x). If, now, f:E > R' is integrable and 


w(x) 
|" fos nay 
exists for nearly every x in [a, b], then 
b W(x) 
[fos nerdy = [dx |" fos »day. 


The identity follows from theorem 8.48 and the observation that, 
if [a, b] x [c, d] > E, then 


d (2) 
[ Foes vay = [°° fs va. 


In particular, take E to be the triangle with vertices (0, 0), (0, 1), 
(1, 1) and let f: E > R? be given by 


T(x, y) = 4(x?+2y +1). 


Then E has content, since its frontier is the trace of a rectifiable curve 
(theorems 8.43, 8.45). Also f is continuous on £. Therefore / is 


1 
integrable over E (theorem 8.44) and | f(x, y)dy exists for every 
x €[0, 1]. Thus 


4 1 1 4 
[, (x?+2y+1)? dxdy = I, dx | (x?+2y+1)? dy 


=|. 2 3} 4 pep 2. 
- tees: xi pepe a ee O38" 
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We also have 


4 fo 4 
: rey tl SNS fol, (x2 + 2y +1)? a 


but the evaluation of the repeated integral on the right is tedious. , 


Exercises 8(d) 4 
1. The set E in R® is bounded. Prove that, if the function f: E > R’ is bounded, 


then { fz and | fz have the same values for all closed intervals J containing E. 
rie I | 


2. Prove that c{(a,, bi) x... X (Gn, bn)} = (64—))...(bn— Gn). 
3. Show that, if E is a subset of R” such that c(£) = 0, then, for any bounded 
function f: E > R’, | f exists and is 0. 

E 
4. The function fis integrable over a set E in R”. (i) ‘Show that f need not be 
integrable over every subset of E, but that fis integrable over every subset with 


content. (ii) Show that, if f is integrable over the subset X of E, then fis also 
integrable over E— X. 


5. Let E,, E, be bounded sets in R” and suppose that the function f: E, U E, > R* 
is non-negative and integrable over E, and over E,. Use the method of exercise 
6(h), 5 to prove that fis integrable over E, U E, and over £,f £, and that 


Pp Spey PET 
E,vUE, E,nH, x, E, 


Prove also that fis integrable over E,— FE, and that 


L |? | in oe 


Deduce that both results hold without the restriction that fis non-negative. 


6. Let the function f be bounded on the (bounded) set E in R”; and suppose that 
there exists a sequence (X;,) of subsets of E such that fis integrable over each set 
E— X, and c(X;,) > 0 as k > o. Prove that fis integrable over E and that 


leccahior bf as k—>o, 


7. The mean value theorem. The set E in R” has content and the function 
f: E > R' is integrable over E. Prove that there is a number 4 between the upper 
and lower bounds of fon E such that 


| f= po: 
Rp. 


If E is also closed and connected and / is continuous, prove that there is a point 
€ in E such that w = f(&). 
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8. Let J be the closed interval 

[a,, by] KX... X [Qn, br] 
in R” and let the function f: J > R! be integrable. For x = (%, ..., Xn) € J, put 

Js = [a,, x1] KX. X% [ans Xale 

Prove that the function F: J > R' defined by 

F(x) =} f wed 

Sa 

is continuous. (F(x) is defined for all x € J, since each J, has content; if x; = a; 
for some i, then c(J,) = 0.) 


9. Prove that the content of an open or closed disc of radius r is mr®. (The 
content of an n-dimensional ball will be found in exercise 8(/), 12.) 


10. (i) Let J be a closed interval in R* and suppose that the function f: J > R’ 
is integrable over J. Show that the set 


S = {(%1, sory Xn Xa+v|%1, ece9 Xn) E I, Xn+1 = f(%1, eres x} 


in R"+1 has zero content. 
(ii) Given a set E in R” and a non-negative function f: E > R’, the set 


D = {(X1y 0005 Xny na) |r, «++» Xn) € EO < Xanga < fH, «++» Xn)} 


in R"+1 is called the ordinate set of f on E. Show that, if E has content and fis 
continuous (as well as non-negative), then D has content and 


aD) =| f 


(Show that xp is nearly everywhere continuous.) 


11. Let a, b > O and let the function f be given by 


fe») = (=EZ)" Gare Ray # -B) 


Find the contents of the ordinate sets (see 10) of f on (i) the closed interval 
[—a, a] x [0, b] and (i) the set 


{(x, y) |y = 0, x*/a*+y?/b? < 1}. 
12. Show that, if S is the square with vertices (1, 0), (0, 1), (—1,.0), (0, —1), then 


x—y+tl1 - 
[aww = log 3. 


1 1 
i dy [ e~ vz dx, 
0 y 


(Note that the integrand is not defined on the y-axis.) 


14. Evaluate { _aedxdydz 
E(xtyt+z4+1)®’ 


13. Evaluate 


\ 


where E = {(x, y,z)|x 2 0,y 2 0,z > 0,x+y+z < I}. 
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15. The real function f on [a, b] is integrable and, for n = 1, 2, ..., the functions 
f, on [a, b] are defined by 


io 


fa [ “= fod: (a < x-<8). 


Hi 
Prove that i) Sn = FnsilX), 
a 


so that f;, is the mth repeated integral of f, i.e. 


hey (e 6 nei i: * F(ty) dt. 


8.5. Differentiation and integration 


The simplicity of the space R! sometimes fails to reveal the ideas 
underlying familiar aspects of real function theory. We saw this in 
chapters 3 and 7; another instance occurs in this section where we 
wish to extend the two relations 


£[t=se9 and [7 =f0-S@ 


(which hold under suitable conditions) to multiple integration. To do 
so we have to introduce derivatives of set functions. 

A set function is simply a real-valued function whose domain is a 
class of sets. For instance, if F is a set in R”, @z is the class of subsets 
of E with content and f: E > R' is a function integrable over E, a set 
function o,:@z — R1 is given (see exercise 8(d) 4), by 


o(X) =| f (Kes), 


Definition. Let E be a subset of R™ and let o be a set function on © x. 
Then o is said to be differentiable at the interior point x of E and to 
have the derivative Do(x) (a real number) if, given ¢ > 0, there is a 
& > 0 such that 
olla) _ Do(x) 
c(,) 

for every cubical interval I, which contains x and has diameter p(I,) 
less than 6. 

Suppose that U is a subset of E at all points of which o is differ- 
entiable. Then o is said to be uniformly differentiable on U if, given 
e > 0, there is a 6 >'0 such that (8.51) holds whenever x € U and I, 
is a cubical interval containing x with diameter p(I,) < 6. 

It is easy to see that the derivative of a set function is unique. For 


<€ (8.51) 
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set functions in R! there is a simple connection with ordinary deriva- 
tives. (See exercise 8(e), 1.) The notion of uniform differentiability 
has not been used before. 


Theorem 8.51. Suppose that E is a subset of R® and that the function 
J: E > R' is integrable over E. If x is an interior point of E and f is 
continuous at x, then a; is differentiable at x and Do;(x) = f(x). 

The proof is very similar to that of theorem 7.62 of C1. (See also 
exercise 6(a), 10.) 


Theorem 8,52. Let G be an open subset of R” and suppose that the 
function f:G — R' is continuous. If H is a compact subset of G, then 
o, is uniformly differentiable on H (and Do,(x) = f(x) for x € H). 


Proof. The compact set H and the closed set G’ are disjoint and so 
p(H, G’) = 2« > 0 (exercise 3(f), 14). Let 


K = UB(x; 2). 
zeH 


Then K is a compact subset of G and so / is uniformly continuous 
on K. Thus, given e > 0, there is a d > 0 such that 


[fu -—f(v)| < € whenever u,veK and |u—o| < 6. 
Let xe H. If £, is any cubical interval containing x such that 
pU,) < min (a, 6), then J, < K and so, for ye L,, 


f(x)-¢ < f(y”) < f(x) +e. 
It follows that 


(F)-Jo) < | F< (I) +9) cll), 


o(I;) 
ee I,) =i (x) 


The last theorem has a partial converse which is particularly 
important. To state it we need one more definition. 


Definition. Let E be a subset of R". The set function o:@, —> R' is 
said to be additive if, whenever X, Y € @z and X (\ Y has zero content, 
o(XU Y) = o(X)+0(Y). 

When the function /: E > R? is integrable over E, then, by exercises 
8(d), 3 and 5, the set function o; is additive. 
_ Note that two closed intervals J, J are such that cZnJ) = 0 if and 


only if any common points of J and J are frontier points of both. 
Such intervals we have called non-overlapping. 


1.e. 


<e. | 
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Theorem 8.53. Let G be an open subset of R” and let o be a non- 
negative, additive set function on € q. Suppose that H is a compact sub- 
set of G and o is uniformly differentiable on H with derivative f. Then 
the function f: H — R' is continuous and, if E is a set with content such 


that E < H°, 
HE) & e f (8.52) 


Proof. 

(i) We first show that the relation (8.52) holds when £ is a closed 
cubical interval J in H°. For this we do not need the hypothesis that 
o is non-negative. 

Given e > OQ, there is a 6 > O such that 


o(1;) 
c( I,) mon (x) 


whenever x € Hand I, is a cubical interval containing x with p(J,) < 6. 
Let u,v € H be such that ||u—v|| < 6/(2,/n). At least one of the 2” 
cubical intervals with diameter 406 and one vertex at u contains v also. 
If 7 is such an interval, 

7), }oW@) 


Ff) < | f-F) +|19_p@)| < 26 


Therefore f is (uniformly) continuous on H and the set function 
o,:6 30 > R' exists. 
By theorem 8.52, o, is uniformly differentiable on J and 


Do(x) = f(x). 


Hence the set function T= 0-0; 


Sag 


is uniformly differentiable on J and D7(x) = 0. Thus, given e’ > 0, 
there is a 6” > 0 such that (1) 


c(Z) 


whenever / is a closed cubical interval contained in J and with 
diameter p(J) < 6’. Divide J into non-overlapping cubical intervals 
I,,..., [, each with diameter less than 0’. Since o,; is additive, so is 
T and we have 


In| = | Enc) 


=< E 


< é’ > cU;) = e'c(/). 
j=1 


But this inequality holds for all e’ > 0 and so 7(/) = 0, i.e. 
o(J) = 0;(J). 
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(ii) Now let E be any set with content and with closure E in H°. 
Since p(E, (H°)’) > 0, given ¢, there are sets U, V, both finite unions 
of non-overlapping cubical intervals, such that 


UcEC VC He (8.53) 
and c(V)—e€ < c(E) < c(U) +e, 
or, by exercises 8(d), 4 and 5, 
c(E—U)<e, c(V—-E) <e. 


If ~ = sup f(x), we then have 
reH 


jf - i “ [fo (8.54) 
= | yl | af tHe (8.55) 


As o is non-negative and additive, we have, by (8.53), 


a(U) < o(E) < o(V) 
and, in view of (i), { JS <8) < | Sf 
By (8.54) and (8.55) we now obtain 
| fove < o(E£) < [_frue 


and these inequalities hold for all positive e. | 


Exercises 8(e) 


1. Let (a, b) be an open interval in R! and suppose that the functions /: (a, b) > R? 
and o : @@,» > R' are connected by the relation 


Ff (%2) -—f(%1) = F(X, X2]) (a < x1 < X_ < BD). 


Prove that o is differentiable at the point x of (a, b) if and only if fis differentiable 
at x and that, when the two functions are differentiable, 


Do(x) = f’(x). 


2. Show that, if Eis a subset of R”, o is an additive set function on @z and _X is 
a subset of E which has zero content, then o(X) = 0. 
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8.6. Transformation of integrals 


Much of the theory of multiple integration is a straight-forward 
extension of the corresponding one-dimensional theory. The most 
conspicuous exception is the change of variable formula 


leo], 02 oe 


(in which J¢ is the Jacobian of ¢). Whereas the proofs of correspond- 
ing results in R' (theorem 6.92 and exercise 6(i), 1) are relatively 
simple, our task now is long and, in places, difficult. The reason for 
this difference is that a continuous function transforms intervals into 
intervals in R1, but not in higher dimensions. We begin by consider- 
ing the effect of a linear transformation on the content of a paral- 
lelepiped. 

A parallelepiped P in R” is determined by the n pairs of hyper- 
planes on which its faces lie. If these hyperplanes are B,, C,,..., 
B,, Cy, where B,, C; are given by 


) 


Ay Xt... $AinXn = (: (6; = G3) 3 =e 


then & = (,,..., &,) €P if and only if 
b; < Qj 6,+ Pee + An En SC; (i = LE er n). (8.61) 


Since the faces of P have zero content (by theorem 8.46), P has 
content. Also, using the inequalities (8.61), we see that, if X is an 
n—m dimensional subspace of the form 


Xy;, = constant,..., x,,, = constant, 


then points of X which are interior (exterior) points of P in R” are 
also interior (exterior) points of Pn X in X. The frontier of Pn X¥ 
(clearly a bounded subset of X) therefore lies on the sets 


Bn X, ts se ¢ = iF ivy AD 


which are hyperplanes in X. Hence Pn X has (n—m dimensional) 
content in X. It follows from theorem 8.31 that c(P) is given by 


In 1 
oF} = [xe =| de,...{ Vpl Xi, +, Ree (8.62) 
: Pn Pi 


where J = [p,, | x... X [Dns Un] is any interval containing P. More- 
over the order of the integrations in (8.62) is immaterial. 
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The inequalities (8.61) also show that, if A is a bijective linear 
function on R” to R”, then A(P) is again a parallelepiped. 


Lemma 1. If P is a parallelepiped in R” and A is a linear function 
oO A Viel, (8.63) 
where J is the Jacobian (i.e. the determinant) of A. 


Proof. If J = 0, then the rows of the matrix of A are linearly 
dependent. Thus A(R”) is contained in a hyperplane A of R” (given 
by an equation a,x,+...+a,x, = 0). Hence A(P) is a bounded 
subset of A and so has zero content. This case will not, actually, 
be used in the sequel. 

Now suppose that J + 0. We first write (8.63) in the form 


| xxe = WIf xe (8.64) 
K eg 


where J, K are any intervals which contain P and A(P) respectively. 
Put Q = A(P). Then, since J + 0 and therefore A is bijective, (8.64) 


may be written as 
3 No. = vif XQo A. (8.65) 


Next we note that A is the composition of elementary linear 
functions “4, a, 7; (i = 1,...,n—1), where 
ji ie ee Xo, .., X,) Ck +0), 
Od eae, Xe, Xo, 55 Xu), 
ee, X) = (4, «on Xpcgs Hy, +015 Xn) 
(It is well known that a non-singular matrix is the product of 
elementary matrices—see, for instance, L. Mirsky, Linear Algebra, 
chapter vi.) Since the Jacobian of the composition of two (linear) 
functions is the product of their Jacobians, it is sufficient to prove 
(8.65) for the cases A = w, a, 7;. The Jacobians of these functions are 
k, 1 and —1 respectively. 
(i) A = pw. Take 
I = [-a,a]x...x[-a,a], K = [-|kla, |k|a]x...x[-a, al], 
where a is so large that J > P and K > Q. Then, by theorem 6.92, 


kif XQ = aif it XQ(kx1, ses Xn) dx, 


a ka 1 
pe [| go me Xe(*1, ony Xn) x ax, ork iL Xe 
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(ii) A = &. Let 
I = [-a, a|x[-), b]x... x[—8, 5], 
K = [-a+b, a—b|x[-5, b]x... x [—5, 5], 
where I > Pand K > Q. Then 


b a 
[, XQon -| Soot Vol Xa + Xa, Xo: «005 Maly 
~ —a 
b at+Z, | 
= | 5 ee | Vol tis Xay>+is Kae 
—b —A+2Z, 


b a—b 
= { de,..., ACEH X99 09 ta) ax, = | XQ: 
—b —a+b -. 


since [—a@+X,a+x.] > [—a+b, a—b] when x, €[—5, 5]. 
(ii) A= 7, If J = K = [-a,a]x...x[-—a, a], where J > P, Q, 
then‘ 


Xy OT; 
7 Q 


= ce AX 541 ie dx,. Le Xol%1. «+5 Xtras Agee 


= sins oe AX j41.. fe NQ(Xts -++2 Nests Lis +205 Xn) OY 


wis [ XQ> 


since the order in the repeated integral may be changed. | 


In the next lemma we no longer confine ourselves to linear trans- 
formations. We denote by I(x; 7) the closed ‘cubical’ interval with 
centre x and sides of length 27. 


Lemma 2. Let G be an open subset of R” and suppose that the function 
@:G— R” is continuously differentiable, injective and such that 
Jo(x) + 0 for x EG. 
(i) If the set E has content and E < G, then the set ¢(E) also has 
content. 
(ii) If H is a compact subset of G, then, as y +0, 


(a) p{P(I(x; 9))} > 0 
and (b) TD) + 66 


uniformly for x € H. 
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Proof. 

(i) Since the set E is bounded (it has content), E is compact and so, 
by theore.n 3.64, ¢(£) is compact. Hence the subset 4(E£) of ¢(£) is 
bounded and, by theorem 8.43 and the lemma of §8.4, to prove that 
¢(E) has content it is sufficient to prove that fr (£) = (fr £). 

We first note that the condition J¢(x) + 0 ensures that the subset 
X of G is open if and only if ¢(X) is open. (See theorems 3.23 and 
7.52.) In particular, ¢(G) is open. 

Now let x € fr E. Put y = ¢(x) and let B,(< ¢(G)) be an open ball 
with centre y. Then ¢-1(B,) is an open set containing x and, since 
x is a frontier point of EZ, d-(B,) contains points of E and of G—E. 
The injectiveness of ¢ then ensures that B, contains points of ¢(£) 
and of ¢(G) - d(E) = ¢(G—E). Hence (x) = ye fr d(£) and so 
d(fr E) < fr f(E£). The proof that fr g(E) < ¢(fr E) is similar. 

(ii) (a) When I(x; 7) < G, 

ploUxs M)} < 2, sup |de)— Hu) (8.66) 
and, for ue I(x; 7), ||x—ul| < (/n)yn. Hence (see exercise 7(c), 4(i)) 
the right-hand side of (8.66) tends to 0 as y tends to 0, uniformly 
for x € H. 


(b) By the inverse function theorem 7.51, 4-1 is continuously 
differentiable on the set ¢(G) and 


De-"((x)) = [Dex]. 
Also, by theorem 3.64, 6(H) is compact and so || D¢-"(y)|| is bounded 


fi Hf); let 
pee sup || Dé-Xy)|| = k. 
yeo() 


If therefore Ddé(x) = A,, Dd-(P(x)) = Azi and 
JAz*@)| < Aull 


whenever x € H and ue R”. The idea of the proof is that, for small 7, 
the image of (x;7) under the transformation , = Az'0 @ 1s, 


approximately, J(7,(x); 7). 
Take e€ such that 0 < e < 1/(k,/n). By exercise 7(c), 4(ii), there 
is ad > O such that 


P(x +h)— G(x) -ALA)|| < ellAl 
for x € H and |h|| < 6. Then, for such x and A, 
Az {d(x +A) — P(x) -—A)}I| < Ke llAl, 
i.e. Yo(x +h) — v(x) —hl| < kellhl. (8.67) 
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Let x ¢ H and let 7 be such that 0 < ,/ny < 6. It was shown in 
the proof of (i) that 


fr C(x; 0)) = v,(fr I(x; 7). 


Now, if x+hefr (x; 7), then 7 < |Al| < ./ny. Hence, by (8.67), 
points on the frontier of (x; 7)) lie within a distance ke,/ny of the 
frontier of J(¥,(x); 7). It follows (see exercise 8(f), 1) that 


M = I(x); n—keJng) < yx; 0)) | Iha(x); 9 +keJng) = N. 


Since A,0 %, = ¢, 
A{M) <= ox; 9) < A). 


By (i), (U(x; 7)) has content. Then, since JA, = J¢(x), lemma 1 gives 
[JA(x)|(1 — ken)” (29) < {Px 9))} < |J6C)|C1 + keyn)" 27)", 


‘ cp T(x; ))} i 
ie. (1—ke,/n)"|Jd(x)| < ee < (l+keJ/n)"|Jd(x)]. | 


Theorem 8.61. Let G be an open set in R” and let E be a set with 
content such that E < G. Suppose also that the function ¢:G —> R” 
is continuously differentiable and injective and that J¢(x) + 0 nearly 
everywhere in G. If the function f:6(E) — R' is continuous, then the 


integrals 
Jet md [,Fosiza 


exist and are equal. 

In the classical notation, according to which the Jacobian Jé of 
the transformation x = ¢(f) (or x; = $4, .... i) @ = 1, a DD is 
denoted by Oy «oy Xp) 


oe ty 


the relation to be proved is written 
IG... XG a -| SIO ge 
o(Z) E 


Proof. 

(i) First suppose that the set O on which Jd(x) = 0 is empty. 

In view of Tietze’s extension theorem (5.52) we may suppose that 
f is defined and continuous not only on the compact set d(£), but 
on the whole of G. Moreover f may be expressed in the form 


tf =f'-f-, wheref+ = 4(|f|+f),f- = 4(|f| —f) and these functions 


Opies Xn)| 
a(t, = t,.) | tj. 
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are non-negative and continuous. We shall therefore assume from 
now on that fis also non-negative. 

If the set X has content and XY < G, then by lemma 2(i), 6(X) 
has content. For all such sets X we define (as we may by theorem 
8.44) the non-negative set function 7 by 


ago YF 


| $(X) 
o is also additive, for if X,, X¥, < G and c(X,nN X2) = 0, then, by 
the lemma of §8.4, c(A(X,) N O(X2)) = c((X%N X,)) = O. 
Let p(E, G’) = 36 and put 


B= J), KK = U Bx: 6). 
ze xeH 


Take x € H and let J, be a closed, cubical interval containing x and 
with p(I,) < 6so that I, < K°. Since ¢(/,) is connected and compact 
(theorems 3.34 and 3.64), there is a point y € ¢(/,) such that 


By lemma 2(ii) (a) and the uniform continuity of f on the compact 


set ¢(K), f(y) > f(¢(x)) as pU,) > 0, uniformly for x ¢ H. Now Je 
is also uniformly continuous on ¢(K) and therefore lemma 2(ii) (5) 


shows that 
c(P(1,)) 
C(I,) = |Jd(x)| 


as p(/,,) > 0, uniformly for x € H. It follows that 


(I) _ py (BCI) 
AL) IO, FEC) 90) 


uniformly for x ¢ H. Since E has content and E < 1°, theorem 8.53 
gives 
o(E) =| (fo#)\J4l. 


(ii) When Q is not empty (but has zero content), then, for each 
integer k > 1, there is a set S;,, the union of a finite number of closed 
intervals, such that S, > Q and c(S;,) < 1/k. Replace each interval 
of S;,, by a similar, concentric, but open interval of twice the content. 
If 7;, is the union of these intervals, then c(T;,) < 2/k. 

Now E—T, has content (8(d), 5), E— 7), is a subset of the open 
set G—S;, and 


P(E-T,) = G(E-(EnT,)) = $(E)- (EN T;). 
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It follows from (i) that the integrals 


| fand { (fos\vel 
$(£)—¢(En Tx) E-(EnTx) 


exist and are equal. Also fis bounded on ¢(E)(< ¢(£)) and J¢ is 
bounded on E(< £). Moreover c(E 1 T;,) >0 as k > o and, since 
|| Dé(x)|| is bounded for x € H, an argument similar to that used in 
the lemma of §8.4 shows that c(¢(En T;,)) +0. The theorem now 
follows from exercise 8(d), 6. | 


The change of variable technique for evaluating integrals in R' 
is familiar and the use of theorem 8.61 in higher dimensions is 


analogous. We start with an integral | fand wish to find a function 
D 


@ such that, when ¢(£) = D, | (fo ¢)|J¢| is more amenable to 
E 


computation than the original integral. Unfortunately as it stands 
the theorem is not quite strong enough to cope with certain situations 
that occur quite frequently. Consider the polar transformation ¢ in 
R? given by 

x = 97,0) = Ftos0, y= off, 0) =F ae (8.68) 
whose Jacobian Jd(r, 9) is r. When D is a (bounded) closed set not 
meeting the half line {(x, y)|x < 0, y = 0} (or some other half line 
starting at the origin), there is no difficulty. But, even when D is as 
simple a set as the closed unit disc (given by x?+y? < 1), theorem 
8.61 is inapplicable; for a corresponding set E, such as 


[0, 1] x [0, 27], 


cannot be enclosed in an open set G in which ¢ is injective. In this 
case a slightly stronger (but more cumbersome) version of theorem 
8.61 is required. 


Theorem 8.62. Let G* be an open set in R and let E, G be sets with 
content such that G is open and E < G < G*. Suppose also that the 
function $:G* — R” is continuously differentiable on G*, injective on 
G and with Jacobian J¢(x) + 0 nearly everywhere in G. If the function 
ft: @(E) > R' is continuous, then the integrals 


| f and | (fo )|J¢| 
$(F) E 


exist and are equal. 
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Proof. We use the same notation as in the proof of theorem 8.61. 
The case E < G and Q = @ corresponds to part (i) of the earlier 
proof. In part (ii) we now let S, cover the set Q U fr G and have 


E-T, = EnT;, < EnT; © GnT; = GT; < Gn S; = G—S,. 
The argument then proceeds as before, except that we use the 
boundedness of D¢ on G instead of on H. | 


Example. 


io) 
—p2 Se es | 


For u > Q, let = 
E(u) = | eran. 
0 


Then E(u) -{" ede | pi dy = | e—u* dx dy, 
0 0 s 


where SS is the square [0, u] x [0, u] in R®. Let C,, C, be the sectors 
ix, pee oe So 0, + < vj, 
{(x, y)|x 2 0, y > 0, x+y? < 2u*}, 


respectively, so that C; < S < C,. Since e-*"-” > 0, 


| e'-V dxdy < {. ev dxdy < | 2 ev —v* dx dy. 

We now evaluate the integrals over C, and C, by using the polar 
transformation ¢ in R? given by (8.68) and appealing to theorem 
8.62. The function ¢ is continuously differentiable in G* = R® and 
injective in G = (0, 2u) x (0, 27); also Jd(r, 9) = r + 0 in G. For 
a set E such that ¢(E) = C, we can take the interval [0, u] x [0, 277], 
since theorem 8.62 does not require ¢ to be injective on E. We now 
have 


i e@'-¥ dxdy = | erdrdé = ¥ do | rear = 4In(l—e). 
C, E 0 0 
Similarly | e-*-v* dxdy = 4n(1 —e7?"”’). | 
a, 
Hence 1n(1—e-“") < E*{u) < jn(1—-e-™) 
and so E%(u) >47 asu—> oo. 


The theorems of this section, when interpreted for integrals over 
one-dimensional intervals, are rather weaker than the corresponding 
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results of chapter 6; at first sight they also look different, for they 
lead to the formula | 
[f= |, ¢oole' (8.69) 
(1) I 


However, let J = [a, £] and put d(~) = a, d(f) = b. The condition 
that ¢ is continuously differentiable and injective entails that ¢’ has 
constant sign. Hence, when ¢’ > 0, (J) = [a, b] and (8.69) is just 
the relation (6.95) of p. 188; and, when ¢’ < 0, d(J) = [b, a] and 


(8.69) is ‘f f= | : (fo $)(-¢'), 


which again reduces to (6.95). The reason why |J¢| appears in 
theorems 8.61 and 8.62, but ¢’ (not |d’|) in theorem 6.92 is that 


b b a 
| f is ‘oriented’ (ic. | f=- | /) , whereas a multiple integral has 
a a b 


no orientation. 
An affine transformation is a function ¢: R” > R” which is of the 


= B(x) = c+ACD), 


where c € R” and the function A: R” > R” is linear. Theorem 8.61 
(with f = 1) shows that content is invariant under an affine trans- 
formation with determinant 1 or —1 (a rigid body motion). This 
means, for instance, that the content of any rectangular parallele- 
piped is equal to the product of the lengths of its sides. 

Finally we note that, taking again f = 1 in theorems 8.61 and 
8.62, we obtain two propositions on content which include various 
auxiliary results that had been needed (the lemma of §8.4 and 
lemmas 1 and 2(i) of this section). 


Exercises 8(f) 
1. Let 0 < a < b. Show that, if the bounded set E in R” is such that 
ucE and frEC¢ J(u; b)—I(u; a), 
then I(u;a)- EC J(u; bd). 
2. Prove that, in theorems 8.61 and 8.62, the function f need only be assumed to 
be bounded on ¢(£) and continuous nearly everywhere in ¢(£). 


3. Show that, in R?, the content of any triangle is equal to half the length of any 
one side multiplied by the corresponding altitude. 


4. Show that, if D is the half-disc {(x, y)|y > 0, x8+y? < 1}, 


(x+y)? 2a 
ress cc Sa Bees 
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5. The function f: R? ~ R! is given by f(x, y) = x?/p?+y?/q?. Find the content 
of the ordinate set (see exercise 8(d), 10) of f on the elliptical set 


{(x, y) |x?/a® + y?/b? < 1}. 


6. Use the transformation (x, y) = ¢(u,v) given by x =u, y=v-—uU to 
evaluate 


| arc tan (x+y) dx dy, 
D 


where D is the triangle {(x, y)|x > 0,y > 0,x+y < I}. 


7. Show that the equations u = x?/y, v = y?/x define a continuously different- 
iable, bijective transformation (x, y) = d(u, v) on (0,0) x (0,0) to (0, 0) x 
(0, 00). Hence find the content of the set 


{(x, y) |x > 0, y > 0, a®y < x® < by, p?x < y® < g?x} 
in R?, whereO0 <a<b,0<p<dgq. 


8. Use the transformation in R? given by x = rcos®0, y= rsin®@ to prove that 
the content of the set 


is 477/35. 


{(x, y, z)|xt+y? +z? < 


9. The curve in R? represented by 
x = a(1t+2cost+cos2t), y = a(2sin¢t+sin2r) (0 < ¢t < 27) 


is called a cardioid (p = q = 1, and change of origin, in exercise 8 (a), 7). Show 
that the trace of this cardioid is given by the equation 


r= 2a(1+cos#) (0 < @ < 27) 


in polar co-ordinates. 
Find the content and centroid of the set 


= {(rcos 0, rsin 8)|0 <r < 2a(1+cos 9), 0 < @ < 27} 


in R®. (The centroid (&, 7) of a set E with content c(£) > 0 is defined by the 
equations 


Ec(E) _ xdxdy, yc(E) = ydxdy.) 
E E 


10. The set of points (x, y, z) in R? such that 8az > x?+y* (a > 0) is denoted 
by S; and P is the plane with equation x+y+2z = 8a. Find the content of the 
(bounded) set Q cut off from S by P. 


11. A cylindrical hole of diameter a is drilled through a solid sphere of radius a. 
Find the content of the material removed if (i) the hole is symmetrically placed, 
(ii) the centre of the sphere lies on the edge of the hole. 

(In (ii) the set whose content is to be found may be taken to be 


{(x, y, z) |x?@+y?+2? < a®; x*+y" < ay}. 


Note that the polar equation of the circle x?+y? = ay in R? is r = asin@ 
(0<6@<7).) 
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12. (i) Prove that, for n 22, the n-dimensional polar transformation 
(X1, ...) Xn) = P(r, A, ..., Ona) given by 

Xj Peo sm Go, Y.. sine, Sin GC, .. 

X= rsint, sin @,..,..sin 04... sin 6... 
r cos G, sin G,.... SiN Gy, 


& 
ao 
Il 


Meay = 1606-0, SNe 
Xt, = FCS TLS 
has Jacobian 
JOG, Oy. cg Ogg) = (= 1" Sn 2 sia. a 
and is injective on every open interval 
(0, a) x (O, 277) x (0, 7) x ... x (0, 77). 


(ii) Show that, in R”, the content c,(a) of an open or closed ball of radius a is 


“am when a= 2m (n= 1, 2; ...), 
m! 


2m+1,,,1 47m 
Sarin ge! when «= 2+!" tn =%; 1,.2) 


Note that, for every a, c,(a) > 0 as n > ©. 
(Pork =:0; 12.04. 


Phas (2 1)? 


ee (2k)! 7 eee 
2k Od9 = $6519 30 oe 
[ sin?” Od [sin di Qk+D!? 


0 | 22k(KN)2” 
see, for instance, C1, 133.) 


~13. Let S be the portion of the closed unit ball in R® which lies in the positive 
octant (where x, y, z = 0). Evaluate 


yz 
.—— dxdydz. 
ie i +e ly dz 
14. Green’s theorem for a disc (cf. theorem 8.31, corollary 3). Let G be an open 
subset of R? which contains the closed unit disc B; and denote the positively 


described unit circle by 2B. Prove that, if the function f = (/,, 4): G > R? is 
continuously differentiable, then 


I = | Oh DA. 
OB B 


Fa ; 0 : 
D, f(r cos t, r sin t)r = se [f(r cos t, r sin t)r cos t]— 7 alr cos ¢, r sin f) sin f] 
r 


(Show that 


and derive a similar formula for D,f,(r cos ¢, r sin f)r.) 
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8.7. Functions defined by integrals 


Let I,J be closed intervals in R” and R” respectively. If the 
function f:1x J > R' is such that, for each y € J, f(., y) is integrable 
over J, we consider the function ¢ on J given by 


Py) = | J = y)dx. 


The question we pose is what conditions on f will ensure the pro- 
perties of integrability, continuity and differentiability for ¢. Actually 
we have already dealt with the first of these properties. Theorem 8.48 
shows that, if fis integrable over /x J, then ¢ is integrable over J and 


eric! 


(In this context ¢ need only be defined nearly everywhere on J.) 


Theorem 8.71. If the function f:IxJ— R* is continuous, then, for 
every yeJ, f(.,y) is integrable over I and the function ¢:J — R'* 


defined by 
60) = | fl yx 


is continuous on J. 

Proof. The integrability of /(., y) for each y is obvious. Next, 
since [x J is compact, f is uniformly continuous. Thus, given € > 0, 
there is a 6 > O such that 

If Y-fe'.y)| < € 
whenever 
(x,y), (x,y) EIxJ and ||x-x'|?+|y-y/? < & 
(where ||x—x’|| and ||y —y’|| denote norms in R” and R” respectively). 
Therefore, if y, y’ ¢J and ||y—y’| < 4, 


190) 8091 < f 1fGs =F Dax < ce(D. | 


In the next theorem, on the differentiation of ¢, we take J to be 
an interval [c, d] in R!. This is the most important case; for the 
general case see exercise 8(g), 3. | 3 
Theorem 8.72. (Differentiation under the integral sign.) Given the 
function f:Ix[c, d] > R' (I < R™), suppose that 


80) =| fle ax 6.71) 
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exists for some y in [c, d] and that D,,,f exists and is continuous on 
Ix [c, d]. Then $(y) exists for all y in [c, d], the function ¢ is differenti- 


able and 
$0) =| Dwafls dx (e<y<d. 8.1) 
In the classical notation (8.72) takes the expressive form 
d Q 
5) Soo nae = | 5 fl sade, 


Proof. Suppose that 
| fos max 


exists, and take any y in [c, d]. Then, by theorem 6.84, 


fox, )- Fx, 1) = | : Dmnaa fx, t) at 


for each x EJ; and, by theorem 8.71, the right-hand side defines a 
continuous function on J. Since f(., 7) is integrable over J, so there- 
fore is f(., y); in other words, (8.71) exists for all y in [c, d]. 

By theorem 8.71, the function y on [c, d] given by 


V0) = | Das fls vax 


is continuous. Then, by theorem 8.48 (or theorem 8.31, corollary 1) 
and, again by theorem 6.84, we have, for c < y < d, 


[’ Y(t)dt = [ra Davi %, Dax = i} dec” Dinvi h(x, tat 


= { (fc, fl, Jide = 40)- 400. 
Finally, since & is continuous and therefore 
d [¥ d 
5 |. vOdt = YO). 
it follows that ¢’(y) exists and is equal to yy). | 


The results we have so far obtained in this section have analogues 
for functions defined by improper integrals (in R1). We shall state 
all theorems for functions ¢ given in the form 


80) = | fee ax; 
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other improper integrals are treated in the same way. The notion of 
uniform convergence once again occupies a key position. 7 


Theorem 8.73. If 
(i) the function f:[a, 00) x J — R'(J © R®) is continuous, and 


(ii) | f(x, y)dx converges uniformly for y €J, 
then the function ¢:J — R* given by 
6) = | fos vax 
is continuous on J. 


Proof. By (i) and theorem 8.71, each function ¢,(m = 1, 2, ...) 


defined by ee 
Pr(Y) - |’ f(x, y)dx (yeJ) 


is continuous on J; and by (ii) the sequence of functions ¢, con- 
verges uniformly to ¢. Hence ¢ is continuous on J. | 


Theorem 8.74. Suppose that the function f:[a, 0) x[c, d] > R* satis- 
fies the following conditions: 
(i) for every X > a, f is integrable over [a, X ixic. ai: 


d 
(ii) for every X > a, | f(x, y) dy exists nearly everywhere in [a, X]; 
c 


(iii) | f(x, y)dx exists and converges uniformly for ¢ < y < d. 


ae | : dy t f(x, y)dx and f dx [s (x, y)dy 


exist and are equal. 


Proof. By theorem 8.48, for every X > a, 
aes z d 
[laf fosxdx and J de] fos 2d 


exist and are equal. 
at+n 
Next, since the functions ¢, given by ¢,(y) =| f(x, y)dx are 
integrable over [c, d] and ¢,(y)} > ¢() = { f(x, y)dx uniformly for 


c < y < dz, it follows from theorem 5.22 that ¢ is integrable over 
[c, d]. To prove the theorem we show that 


fFae[Sto nd [a fps. 
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as X -> 00. Now, given e > 0, there is an X, such that 
sup <€ 


| flex, yy dex 
c<xy<d | JX 


for all A > 2, ren, for x >22,, 


[4 [re nax—[" ae fx, av! 


= i dy | fx y)dx—[" dy | fix y)dx 


< (d—c)e. | 


d fea) 
< [ay "fe, nde 
c xX 
Note that conditions (i) and (ii) are both satisfied when f is con- 
tinuous on [a, 00) x [c, d]. 


Theorem 8.75. (Differentiation under the integral sign.) Given the 
function f:[a, 00) x [c, d] > R1, suppose that 


@ $69) = | fos ye 


exists for some y in [c, d]; 
(ii) D,f exists and is continuous on [a, ©) x [c, d]; 


(iii) | D, f(x, y)dx converges uniformly for c < y < d. 


co 


Then | T(x, y)dx exists and converges uniformly forc < y < d, the 


function ¢ is differentiable and 
$0) =|" Dif. dx ( <y <a) 


As in theorem 8.72, the last identity may be written 


5 jf y)dx = | : aft y) dx. 


Proof. Suppose that | I(x, 9) dx exists. By (ii) and the first part 


x 
of theorem 8.72, { I(x, y) dx exists for all X > a and all y in [c, d]. 


In view of (i) and (ili), given ¢ > 0, there is an X, such that, 
whenever X, > X, > Xj, 
xX, 
| D, f(x, Hax 
xX, 


x; 
<6) fors-e tal 2a { Fix, Bar| =<. 
Xi 
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Now take any y in [c, d]. Then, by theorem 6.84 and theorem 8.48 
(or theorem 8.31, corollary 1), we have when Ke > Hy Fae 3 


[re Ses mae [ax |” Deft a 


y Xs 
{ dt | lp feeitidx 
U) XY 


< ly—yle < (d—c)é, 
and so 


>. X. 
{ I(x, y)dx | f(x, n)dx|+(d—c)e < (1+d—c)e. 
=, ee 


Therefore, by the general principle of uniform convergence (6.34), 


© 
a 


< 


f(x, y)dx exists and converges uniformly for c < y < d. 


By theorem 8.73, the function y on [c, d] given by 
vo) = | Daft, »)dv 


is continuous. Theorems 8.74 and 6.84 now show that, force < y < d, 


["voae = [ral Dp f(x, tdx = y dx" Daflx, tat 


= |" 1/9 -Fle, de = $0)- $00. 
But, since y is continuous, 
< [yd = Wo) 
dy Je : 
Therefore $’(y) exists and is equal to Y(y). | 


The last three theorems invite comparison with theorems 5.21—5.23 
which describe the limit functions of uniformly convergent sequences. 


Example. For y 2 9, 


2 


(i) y > 0. Comparing the integrand with e-*” we see immediately 
that the integral exists. 
We have 9 ( 


ay 


I, ew = dx =~ —arc tan y. (8.73) 


= 02) = —e*Y sin x. (8.74) 
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Also the function defined by (8.74) is continuous on R? and, by the 

M-test (6.35), = 

| (—e-* sin x) dx 
0 


converges uniformly on any interval [c, d] with c > 0. Hence, by 
theorem 8.75, if 4(y) is the integral in (8.73), 
1 


Pe) = -{' eV sin xdx = {ep 
for c < y < d. Thus there is a constant A such that 
d(y) = A—arc tany (8.75) 


when yeé[c, d]. But [c, d] is an arbitrary interval such that c > 0 
and so (8.75) must, in fact, hold for all y > 0. 
To determine A we note that 


o0)| < | ewvde = ©. 


Therefore, letting y > in (8.75), we obtain 0 = A—41, i.e. (8.73) 
holds for y > 0. 

(ii) The fact that (8.73) holds for y > 0 follows at once from the 
example at the end of §6.9 where it was shown that 


ees 1 
I. BX dx = 7 (8.76) 


But we can also use the methods of the present chapter to evaluate 
this integral. 

We first prove that the integral in (8.73) converges uniformly for 
y 2 0. For x > 0, e~*”/x is positive and decreasing in x. Hence, by 
the second mean value theorem (6.93), 

= sk | eee 
in e Tene < x x, 

when X, > X, > 0 and y > 0. Since 2/¥,>0 as X, +0, the 
general principle of uniform convergence shows that the integral 
defining ¢(y) converges uniformly for y > 0. 

We now know from theorem 8.73 that ¢ is continuous on [0, 00). 
Therefore, by (i), 


~(0) = be 90) ce = ede tan y) = 47, 
1.e. (8.76) holds. 


3 
{ sin xdx| < 
: Fi 
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Exercises 8(g) 


1. Suppose that the function f: [a,0)xJ>R'VUS R") is continuous and non- 
negative, that 


(ee) 
60) = [ "fx, ode 
a 
c 
exists for all y ¢J and that ¢ is continuous on J. Prove that [ f(x, y)dx con- 
a 
verges uniformly for y € J. (Cf. Dini’s theorem 5.35.) 


2. Let I be the closed interval [a, b] x [c, d] in R® and let the function f: J> R* 
be continuous. For (x, y)¢I°, put J(x, y) = [a, x] x[c, y]. Prove that the 
function F: I° > R! defined by 


F(x, y) = { 
J(z,y) 
is such that, for all (x, El’, 
D, D,F(x, y) = Dz Di F(x, y) = f(x, y). 
(Cf. exercise 8(d), 8.) 


3. Let I, J be closed intervals in R” and R” respectively. Suppose that the 
function f: 1x J > R' is such that 


i < [ rae 


exists for some y € J, and the partial derivatives Dniif, ..-» Dinin f exist and are 
continuous on Jx J. Prove that ¢(y) exists for all y € J, that 


Dan = [, ae See es ay 2 ee ss 


and that ¢ is continuously differentiable on J°. 


4. Show, by an example, that in theorem 8.72 the condition that 


i; f(x, y)dx 


exists for some y in [c, d] may not be omitted. 


5, The function ¢ on R? is defined by 

40 

(x) = { cos (x sin t) dt. 

0 

Show that y = A(x) is a solution of Bessel’s differential equation of order 0: 
d*y dy = 
Xia an == 0; 
6. The function ¢ on R!—{-—1, 1} is defined by 
d(a) = i log (1—2a cos x+ a?)dx. 


Prove that ¢’(a) = 0 and hence evaluate ¢(a) for |a| < 1 and for la] > 1. 
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7. The function f: [a, b] x [c, d] > R} is continuous and such that D,f exists and 
is continuous. Also the function g:[c, d] > [a, blis differentiable. Show that the 
function ¢: [c, d] > R' defined by 


gly) 
i= [ fC. vide 


is differentiable and that 
rae 
a 


) 
?(y) = [ : D. f(x, y)dx+e"(y)f{g(y), Y}- 
The function F on (—4z, 47) is given by 
F(y) = log (1+ tan x tan y)dx. 
0 


Prove that F’(y) = logsec y+y tan y and deduce that F(y) = y log sec y for 
—in<y < 4z. 


8. Let B be an open ball in R” and let the function f = (fj, ..., f,): B > R” be 
continuously differentiable. Prove that a necessary and sufficient condition for 


i f to be independent of path in B is that, in B, D:f; = D;f; G7 = 1, ..., 7). 
‘ | 
(See exercises 8(b), 7 and 8.) 


9. Prove that, if 0 < a < 5, then 


b CO oa) b 
[ dy [ e-dx and [ dx [ e~*vdy 
Ja 0 0 a 


exist and are equal. Hence evaluate 


CO »p—ax —bx 

ce =< € 

{ —_—_—_———~ dx, 
0 x 


(This integral may also be evaluated by the Frullani method of exercise 6 (i), 2.) 


10. Prove that 


1. ee) 1 co 1 1 
asta Siaas al eee 
[io |. l+x?+y Bee [, dx | i¢etp” 


exist and are equal. Deduce that 


7 arc tan (sin 0) 
eer ets i] = 2 
i: <in 8 dé = 47 log (1+ /2) 


11. Let ¢ on R? be defined by 
© /sin xy\? 
60) = [° (2) as 
0 x 


Show that $’(y) = 47 for y > 0, and hence evaluate ¢(y) for all y. 
Use a similar method (and the formula 4 sin? 6 = 3 sin @—sin 3@) to evaluate 


oe) 1 3 
F (2) 
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12. Show that = 
Py) = { e-* cosh xy dx 

0 
exists for all y and everywhere satisfies the differential equation ¢’(y) —4y¢(y) = 0. 
Hence evaluate ¢(y) for all y. 


13. The function ¢ on R? is defined by 


Gi XY 
#0) =| x +x)" 


Prove that, when y > 0, ’(y)—¢(y) = —42. Hence find ¢(y) for all y. 
14. The function ¢ on R? is defined by 


Jane { arc tan ax ane tan bx ~— 
0 x 
Prove that, for a > 0, b > 0, 
= dx 4a 
D, D800) = |° Graces bay ~ axb 
and d(a, b) = 4n{(at+ db) log (a+ b)—a log a—b log 5}. 


15. Suppose that the function /: [a, 00) x [b, 0) > R* is non-negative and, for 
every Y > 3B, e y y = 

{ dx { f(x,y) dy, { ay | f(x, y)dx 

a b b a 


exist and are equal. Show that, if one of the expressions 


z dx [40 y) dy, [Po [re y)adx 


exists, then so does the other and the two are equal. (Cf. theorems 4.73, 6.33.) 
Evaluate mm ar 
[ dx [, +e" dy. . 


NOTES ON CHAPTER 8 


§8.1. In chapters 10-14 we generally deal with paths, i.e. equivalence classes 
composed of piecewise continuously differentiable transits. The definition of 
equivalence which we use is best suited to such transits. In a context where 
differentiability properties are unimportant, we might call the transits ¢ : [a, b] > 
R", :[c, d]> R” equivalent if, simply, there exists a strictly increasing, con- 
tinuous function « such that a(a) = c, a(b) = dand ¢d = you. 

In the definition on p. 248 the supposition that ¢:[a, b] > R” (1 > 1) is 
continuous could be omitted. The expression _ 


sup 3 |6(t)- $a (x) 
9 i=1 


if finite, is then more appropriately called the variation of @ and denoted by 
V°(¢). However this extension of the concept of bounded variation does not lead 
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to an essentially new class of functions, for the proof of theorem 8.11 (in which 
continuity is not used) shows that (*) is finite if and only if the components of 
@ are of bounded variation. 

A space filling curve was first constructed by G. Peano (1890). The example 
in exercise 8(a), 11 is due to I. J. Schoenberg (Bull. American Math. Soc. 44 
(1938), 519) who simplified an earlier construction of Lebesgue’s. 


§8.4. Surfaces, like transits, are naturally grouped into equivalence classes. 
The manner in which this is done again depends to some extent on the circum- 
stances. For instance the two surfaces ~: G > R*, * : G* > R" (G, G* < R%, 
1 < m < n) could be taken to be equivalent if there exists a bijective, continuously 
differentiable function «: G—G* such that Je > 0 throughout G and ¥ = 
y*oa. The restriction on the functions to be continuously differentiable is an 
indication of the considerable difficulties with which the theory of surfaces is 
bedevilled. 

The phrase ‘nearly everywhere’ (meaning ‘except in a set of zero content’) 
corresponds to the phrase ‘almost everywhere’, used in the Lebesgue theory of 
measure and integration, which means ‘except in a set of zero measure’. (In the 
notes at the end of chapter 6 we defined one-dimensional zero measure; the 
generalization to R” involves no more than the replacement of length by volume.) 
Since a set of zero content necessarily has zero measure, ‘nearly everywhere’ 
implies ‘almost everywhere’; but the opposite implication is false. 


§8.6. Two results (theorem 8.31, corollary 3 and exercise 8(/), 14) have been 
labelled special cases of Green’s theorem. With the material at our disposal only 
a general description of the parent theorem is possible. Suppose that the trace 
of a path y in R? is the frontier of a bounded set E and that f= (f,,f,) is a 
function on E to R®. Green’s theorem states conditions under which 


[ f= [[ @h- Dd. 


© sin x 
— dx 
b= 2 


§8.7. The integral 


has now been evaluated in two different ways (in this section and in §6.9). A 
further method, depending on complex function theory, will be found in §14.1, 
example 2. G. H. Hardy made an entertaining comparison of a number of methods 
of calculating this integral, assigning marks for simplicity, elegance, etc. 
(Mathematical Gazette 5 (1909), 98-103 and 8 (1916), 301-3.) 


9 
FOURIER SERIES 


9.1. Trigonometric series 


This chapter contains salient facts about trigonometric series of the 
form =. 
4a)+ > (a, cosnx+b, sin nx), 
n=1 


which we shall sometimes write more shortly as 
x A,(). 
n=0 


The topic has intrinsic importance in the expression of the periodic 
phenomena which occur in all branches of natural science. Moreover 
it will give the reader valuable practice in applying principles and 
theorems from earlier chapters, notably 4 and 5. 

The history of the subject is of high interest and is outlined in a 
note at the end of the chapter. Repeatedly the settlement of a con- 
troversy or the resolution of a difficulty thrown up by trigonometric 
series has clarified the foundations of analysis or has engendered new 
ideas and methods. 

The central problem of the chapter is the validity of the formula, 
say for x in [—7, 7], 


F(x) = 4ayt+ s (a,, cos nx +b, sin nx). (9.11) 
n=1 


The coefficients a,,, b,, are real and the function fis from R* to R’. 
The sum of the series on the right of (9.11)—if it converges—has 
period 27, so it is convenient to suppose that f is defined for all 
values of x and that f(x+27) = f(x). (If an fis assigned for which 
f(a) + f(—7), then fcan be redefined at one or both of these values.) 

We shall always suppose that fis Riemann integrable ($6.7) over 
[= 71). 

Let us make the assumption that the series on the right of (9.11) 
converges uniformly in [—7, 7]. The convergence remains uniform 
when we multiply by either of the bounded functions cos nx or 
sin nx. We can then integrate term by term over [—7, 7] by theorem 
5.22. 
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Use the facts (the orthogonality of the cosine and sine functions) 


a sin mx cosnxdx =0 (all m,n), 


1 o 


TT 
| cos mx cos nx dx -| sinmx sinnxdx =0 (m+n), 


—7 or 


[’ cost nx dix = |” sin?nxdx=7 (n+ 0) 


i = 


and; ifm = 0, | : cos’ mx ok = 27. 


The result of integration term by term is - 


<'g 


Qn = — IO) cos.mx dx, =, = 2 IO) sin nx dx. (9.12) 
(Note the reason for the factor 4 in the term 4a, in 9.11.) 

We repeat that the passage from (9.11) to (9.12) is purely formal 
in default of some justification such as the uniform convergence of 
the series in (9.11). We can, however, starting with an integrable f, 
define the numbers a,,, b,, from it. 


Definition. The numbers a,,, b,, given by (9.12) are called the Fourier 
coefficients of f. The series (9.11) with these values of a,, b,, is called 
the Fourier series of f. 

From the periodicity of the integrands in (9.12), the interval 
[0, +27] for any 6 would serve to determine a,,, b,. 


Theorem 9.11. If we are given a trigonometric series 
c 
sagt = (a,, cos nx+f,, sin nx) 


and it converges uniformly in [6, 0+ 27], then it is the Fourier series of 
its sum. 


Proof. Let s(x) be the sum. We can integrate the uniformly con- 
vergent series term by term after multiplying by cos mx or sin nx. 
This proves that 


1 6+27 | 6427 
i ap s(x) Cas nx dx; ff, = =|. s(x) sin nx dx. | 


From the definition, the Fourier series of a given function is 
unique. The converse does not hold without restriction on the 
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function. If g(x) = f(x) except for a finite number of values of x 
(more generally, except in a set of content zero), the corresponding 
integrals (9.12) are equal and f, g have the same Fourier series. The 
next theorem shows that two different continuous functions cannot 
have the same Fourier series. The reader should note this fact but 
he may appreciate the proof better if he defers it until he has gained 
more familiarity with trigonometric series (e.g. from §9.2). 


Theorem 9.12. If the R-integrable functions g, and gz are continuous 
at c and g,(c) + g,(c), then the Fourier series of g, and g, are different. 


Proof. Suppose the conclusion false. Then all the Fourier coeffici- 
ents of f = g,—g, vanish, and f(c) + 0, say f(c) > 0. 
If t,, is any trigonometric polynomial 


m 
sagt py (a, cosnx+f, sin nx) 


and 7 is any number, then 
+27 
ce £0, (9.13) 


7] 
Since f is continuous at c, we can find A (with 0 < h < =), k such 
that f(x) >k >Oforc-h<x<cth. 
We now take f,, to be defined by 
tn(x) = {1+cos (x—c)—cos h}”. 
Then f,,(x) may be shown to be a trigonometric polynomial. It has 
the properties 
(i) t,{x) 2 1 tee ek = x ee, 
(ii) as m > ©, t_(x) > 00 uniformly for 
c—th <x <c+4h, 
(iii) |t,,(x)| <1 for ct+th<x<c—h+2n, 
Then, by (i) and (11), 
c+h c+th 
| (ie | slim > hk inf t,,(x) > 00 as mM>oO., 
c—h c—sh 


c—h+2n 
Also, by (iii), | fp 


cth 
is bounded. So, if m is large enough, 


c—h+27 
| fm 
—h 


cannot be 0, which contradicts (9.13). | 
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Corollary. If f is continuous and the Fourier series of f converges 
uniformly, its sum is f. 


Proof. If the sum is g, then from theorem 9.11 the series is the 
Fourier series of g. Now apply theorem 9.12. | 


Note on complex Fourier series. Some results are more elegant if 
expressed not in cosines and sines but in the equivalent complex 
exponentials. We have 


1 7 


2a J 


e™dx= 0 (7 
L {4 =.0). 

The formal Fourier series of f, in general complex and equal to 
gt+ih, say, is then 


| 
a 
joe 
I+ 
N 
~ 


cO co 


= 


| : —Inx 
where Cn = 5 > J(x) et™ dx. 


b 
(The integral of a complex function | {p(x) + ig(x)}dx is defined 
b b . 
to te | Be ore | goad) 


It is easy to verify that a necessary and sufficient condition for f 
to be real is that, if c, is «,+if,, then c_, is its conjugate «, —i/,,. 

It will be convenient to confine ourselves in this chapter to real 
functions and to cosines and sines. 


9.2. Some special series 


Before studying general theorems about Fourier series, it is 
desirable to make oneself familiar with particular examples. 


Example I. Prove that the Fourier series of the function which is equal 
io x* in[—7, a] is 


77 cos x cos 2x 
us 4{ - Rt SS (-D) 


COS NX 
mM ete 
2 eee e 


State the sum of the series for all values of x. 
— Deduce that 


9.2] SOME SPECIAL SERIES 303 


Solution. By definition of Fourier series, the constant term and 
the coefficients of cos nx (n > 1), sin nx are 


| Be ee Se al L {* 
= dx, -= x2cosnxdx, - x? sin nx dx. 
ec ne re i 


Since x? is an even function, the integral with sin mx is O for all n, 
and the cosine integral is most easily evaluated as twice the integral 
over [0, 77]. Integration by parts gives the series required. 

The series converges uniformly for all values of x by Weierstrass’s 
M-test (theorem 5.32). By theorem 9.12 (corollary), the sum for 
—m < x < mis x*. The sum outside this range is (x — 27)? in [7, 37], 
(x — 47)? in [37, 57] and so on (the parabolic arcs being seen more 
vividly in a diagram). 


Putting x = 7, we obtain the sum ¥ (1/n?). | 
1 


Example 2. Prove that the Fourier series of the function which is equal 
to x? in (0, 27) is 


a ~ s (5 cos nx zl sin nz) 


Calculate the sum of this series for x = 0. 
Solution. To obtain the series, we have only to work out the 
constant term and the coefficients of cos nx, sin nx. 


1 2m 


1 27 1 Qn 2 
~— | x*dx, —]| x*cosnxdx, -— x? sin nx dx. 
21 0 77 Jo 7 JO 


Outside the interval (0, 277) the function which has this Fourier 
series has, by periodicity, the value (x —2k7)? when 


2kn < x < 2k+4+1)7 


for every positive or negative integer k. The function may be given 
an arbitrary value for x = 2k7; whatever value is assigned, the 
function is discontinuous there. 

The only convergence theorems so far proved (9.11 and 9.12) say 
nothing about the possible sum of the Fourier series of a discon- 
tinuous function. Such theorems will follow in §9.4. 

This example shows a phenomenon which will be found later to 
be general. At a discontinuity x = c of f at which the right- and left- 
hand limits f(c+) and f(c—) exist, the Fourier series of f will, if it 
converges, have the sum 4{ f(c+)+/(c—)}. 
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Taking here c = 0, we have as the sum of the series 

47? 2 4 

ci. 
which is 27? from example 1. As x >0+, the function x? in (0, 27) 
tends to 0. As x > O-, the function (x+ 27)? in (—2z, 0) tends to 
4r*. 

Fourier cosine series and sine series. We saw in example 1 that, 
if fis even, i.e. f(—x) = f(x), its Fourier series contains cosines only. 
Similarly if fis odd, i.e. f(—x) = —/(x), the integrals for a, are all 
0 and we have sines only. 

If, then, f is defined in the interval [0, 7], there is a Fourier cosine 
series associated with the even function in [—7, 7] coinciding with f 
in [0, z]. Similarly, if fis defined in (0, 77) and we define 

£0) =f) = 9, 
there is a Fourier sine series associated with the corresponding odd 
function in [—7, 77]. 
Example 3. Let f(x) be defined in [0, 7] to be x in [0, 47) and m-—x 
in [47, 7]. Prove that the Fourier cosine series of f is 
m 2 (COS 2x , COS 6x | COS 10x 
fae P a 5° be 2 


Prove that the Fourier sine series of f is 


4 (= x sin 3x, sin 5x _ 
a\ i = > oS 

The only feature not present in Example 1 is the necessity to divide 
the range of integration into [0, 47], [47, 7] in calculating the Fourier 
coefficients. 

As in Example 1, the series are uniformly convergent to the 
appropriate f(x). 

Example 4. Prove that the series 
5 
n=1 
converges to 4(7m—x) for 0 < x < 27. 

Notes. (1) We have previously shown in the example on p. 118 
that this series converges for all values of x, and that the convergence 
is uniform for § 2x28 
for any fixed positive 0. 
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(2) Observe that we have not here mentioned the phrase Fourier 
series. We shall solve the example from first principles. 
Solution. Write : 


sin rx 
Sn(x) ae ieee 
r=1 7L 
: sin (n+4)x 
Then si(x) = cosrx = —44-— >" 
nb) = 279 sin dx 
“sin (n+4)t 
So s(x) = —4x4+] edt. 
nl) : 2 sin 4f 
Putting x = 7 in this, we have 
™sin (n+4)t 
Ge i OE 
gy 250 3 


By subtraction we have 
un) —5,(0) =|" 


sin (n+3)t 


dt. 
2 sin $t 


The right-hand side, integrated by parts, is 


| - cos (n+3)t [- ™cos (n+4)t cos $t 
2(n+4) sin $tlz Jc 4(n+4) sin? dt 
If 6 < x < 27-6, the modulus of this does not exceed (say) 


1 1 
(2n+1) sin 38° 2(2n+ 1) sin? 36° 


We have thus proved that the series 


converges to 4(7—x) for 0 < x < 27. Moreover the bound obtained 
for |s,(x)—3(7—x)| shows that the convergence is uniform for 
6 <x < 2m—6. (This result had previously been deduced from 
theorem 5.33.) The sum of the series (obtained by periodicity out- 
side (0, 27r)) is discontinuous at 0, + 27.... 

It is easy to verify that the series is the Fourier series of its sum, 
as defined by (9.12). 

We add one further property of the series which will be used later 
(in theorem 9.51), namely the existence of a constant B such that 


|S»(x)| < B for all x, n. 


In proving this we may assume 0 < x < 7. 
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Let m = min (, [7/x]), where [a] means the integral part of a. 
Then 


5,(x) = (E+ 3 pe = t,+1,, say, 


)= sin rx 
mt+i1 


where the sum /, is empty, and so zero, if m = n. Then, using the 
inequality sin u < u valid for u > 0, we have 


ik exe = Rie Se 
: 
Also, from example (p. 91) and the inequality sin u>2u/7 (0<u<4m) 


ll < pa See SE 
a ™ (m+ 1) sin ix ~ (m+Dx- ~ 


Exercises 9(a) 


1. Calculate the Fourier series of |x| for —7 < x < 7. What function does the 
series represent for other ranges of x? 
Sum the series = 1 


~ (2n+ 1)?" 


2. Prove that, if a + 0, the Fourier series of e* in (—7, 7) is 


2 = ant (5+ > (— 1)" (acos nx—n sin - 


4a. Sax n* +a? 


3. Prove that, if —7 < x < 7 and ais not an integer, 


2asinatm(1 cosx cos2x 
cos ax = ———_ —., 


1 2a® @—1. a—4 
Hence express cot a7 as an infinite series of partial fractions. (This result will 
be examined more generally, for complex a, in § 14.3.) 
4. Prove that, for a range of x to be specified, 


fe @) 
es = x cos 2nx 


T n= 42-1" 


5. Expand cos bx in a sine series for 0 < x < 7, taking b to be (i) not an integer, 
(ii) an integer. 


6. Prove that the function f defined by 
(0) =40a—-a) for —-a 
f(@) =40(7-0) for ax 


is represented in [—a@, 27°—.«] by the series 


IN 
x 
A 
> 


< 
6 


7] 
< 27-a 


> sin nO sin na 


n=1 n* 
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What locus in R? is represented by 


> sin mx sin ny 


= 07 
n=1 n° 
7. Prove that the following three expressions all represent x(/—x)forO< x <1: 


8/2? 2 sin (2n—1)(7x/l) . 21? 2 sin? ge 


T no4 (2n—1)8 > amt n* 
Pf Pi 606 (2nmx/1) 
6 a1 n® 


8. Prove that the function /*— x? has, in the range [—/, /], the Fourier series 


an {2-2 =% = = 05 


9. The function fis defined in the range [0, 7] as 


f(x) = 3x when 0< x <7, 
f(x) = 40 when 47 < x < 37, 
f(x) = Xa—x) when 37 <x <7. 
poet Wa= 6 2 sin sin $(2n— 1)7 sin (Qn— Ix 
ot ge 1¥ 
10. We proved in §9.2, example 4, that 
>= sin rx eee 
1 


7 sin t pam ft —. 
Sharpen the bound to = dt+1. The exact supremum is as dt; this is 
0 


0 
harder to prove (Gronwall, Math. Annalen 72 (1912), 228-61). Prove, for the 
analogous integral, that, for all 7, x, 


9.3. Theorems of Riemann. Dirichlet’s integral 


The first theorem of Riemann (extended later by Lebesgue) 
includes the result that the Fourier coefficients a,,b, of an R- 
integrable function tend to 0 as n tends to infinity. 


Theorem 9.31. (Riemann-—Lebesgue.) If f is R-integrable over [a, b}, 
then 


| - Aaa cd a en eds 


tend to 0 as A tends to infinity. 


Proof. It is sufficient to give the proof for the cosine integral. 
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Given ¢, take a dissection 


G2 x, * He = X, = 3B, 
n 
such that > (M,. —m,)(%;— Xp-1) < 3 
r=1 


where M,, m, are the supremum and infimum of fin [x,_,, x,]. 
Define g by putting g(x) = m, for x, < x < x,. Then 


b b b 
| F(x) cos Axdx -| { f(x) — g(x)} cos Axdx + | g(x) cos Axdx. 


The first integral on the right is numerically at most 


n 


[-\7e)-geolar < 3 Ot, -m)@,-ma) < 6 


The second integral is 
> m, (sin Ax, —sin Ax,_1)/A 
r=1 


which is numerically less than or equal to 
2(%|m,|)/A 
and this is less than € if A > A,(e). | 
Taking A = n and [a, b] = [—7, 7] we have the result for Fourier 
coefficients. Without restricting f it is not possible to make any 
assertion about the rapidity with which a,, b,, tend to 0. As a speci- 


men of an improved result under an additional assumption about /, 
we quote the following theorem. 


Theorem 9.32. If f is of bounded variation in |—7, 7], then, as n > ©, 


aio rods =U): 
n n 


Proof. From theorem 6.43, we may suppose that f increases in 
[—7, 7]. By the corollary to the second mean value theorem 6.93, 


iz f(x) cos nxdx = f(-m |" cos nxd+-fa)| "cos nx dx 


9.3] THEOREMS OF RIEMANN AND DIRICHLET 309 


Theorem 9.33. (Dirichlet’s integral.) The n-th partial sum 


Sn = S,(X) = 4a,+ > (a, cos rx+ 6, sin rx) 
r=1 
of the Fourier series of f at x is 
af) Dylt—2f(0 dt, 
where the Dirichlet kernel D,(u) is defined by 
sin (n+4)u 


Proof. Note that D,,(u) may be left undefined at u = 0 or may be 
defined as its limit n+. 
Forr.=0as se 


a, cos rx +b, sin rx 
= (- | : S(t) cos rtdt) cos rx + al JH sin rtdt) sin rx 


“ =f cos r(t—x) f(t) dt. 


Substituting in s,, we have 


S, = | f +3 cos nt} £0 dt 
=F 1 
which leads to the result. | 


There are easy variants of this formula for s,,; it is to be borne in 
mind that fand D,, have period 27 and so any range of integration 
of length 27 will serve. We have then 


Six) = : 3 D,(u) f(x + u) du. 


Putting in this formula —wu for u in (—7, 0), we have 


sa(3) = 2” Daud {flo +4) + fle— a} du 
We note also that 


a D,(u) du 


Il 
RS) 
a 
die 
~f- 
HM 3 
QO 
eo) 
~” 
oe 
ed 
Ea 
> 
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We can now state as a separate result one of the most useful 
conditions for convergence of a Fourier series. 


Theorem 9.34. A necessary and sufficient condition that s,(x) con- 
verges to S asin —> © is that 


[D.C oe a 
0 
tends to 0 as n tends to co, where 
$(u) = $(x, u) = f(xt+u)t+f(x—u)—2s. 
Proof. From the immediately preceding work the integral is 


7(Sn(X) ts S). 


Theorem 9.35. (Riemann’s localization theorem.) The convergence 
or divergence of the Fourier series of f at x depends only on the values 
of f in an arbitrarily small neighbourhood (x —6, x +6) of x. 


Proof. From the remark following theorem 9.33 we have 


) 7 
sa) = 2 (f +f") DaGd Mer te) + flew} du 
In [6,7], {f(x+u)+f(x—1u)} cosec 4u being the product of two 
integrable functions is integrable. By the Riemann—Lebesgue theorem 
9.31, the term A in the formula for s,(x) tends to 0 as m tends to o. 
F ) 


So s,(x) and the integral from 0 to 6 in the formula for it both 
converge to the same limit as m tends to infinity or both diverge. 
The integral involves only the values of fin (x— 6, x+6). | 


We close this section with a modification of theorem 9.34 which 
is often easier to apply. 


Theorem 9.36. A necessary and sufficient condition that s,(x) con- 
verges to s is that, for some 6 (where 0 < 6 < 7), 


[, sin (n+4)u—— we du 


tends to 0 as n tends to oo. 


Proof. 


sin(nt+})u = 
——— = sin (n+ 3)uf — -;| 


2sinZu u 


D n(U) ag 
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and the expression in curly brackets, being O(u) as u > 0, is integrable 
over [0, 6]. Hence, by the Riemann—Lebesgue theorem 9.31, 


3 dnenale du —[ DB deads 


u 


tends to 0 as n tends to infinity. 
The proof of theorem 9.35 shows that the condition stated in 
the present theorem is equivalent with that of theorem 9.34. | 


9.4. Convergence of Fourier series 


Theorems 9.34 and 9.36 can be transformed into a variety of 
shapes; two are given in theorems 9.41 and 9.42. The first form is 
very easily deduced from theorem 9.36; the second also is in common 
use. 

Since the value of f can be changed arbitrarily at a point without 
alteration of the Fourier coefficients, any condition of convergence 
of s,(x) to f(x) must take account of values of f not only at x but 
in a neighbourhood of x. (Continuity of f at x would be such a 
condition: we shall see in theorem 9.51 that it is not, in fact, strong 
enough to entail convergence.) 


Theorem 9.41. (Dini.) A sufficient condition for the Fourier series 
of f to converge to s at x is that the integral 


(rela 
0 Uu 


7 
exists (in the sense of lim : 
7 


h> 


Proof. Given e, choose 7 such that 


{ PO)! 4, — 
0 4& 
Then, for all n, 


[ sin (1 + jul a, 
0 u 


< [Ola << 


By the Riemann—Lebesgue theorem 9.31, there is m,(€) such that 


[sin +322 


Se f- 3 Ny 


These two inequalities combine to give the sufficient condition of 
theorem 9.36. | 
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Corollary. If f is differentiable at the point x, then the Fourier series 
of f converges at x to f(x). The conclusion remains true under the more 
general hypothesis that, for some « > 0, 


f(x +h)—-f(x)| = O(h|%) 


as h tends to 0. 
Proof. With ¢(u) = f(x+u)4+f(x—u)—2f(x), we have as u > 0, 
}P(u)/u| = O(|ul*-*) 


and the integral in the theorem exists. | 


Theorem 9.42. (Jordan.) If f is of bounded variation in a neighbour- 
hood of x, then the Fourier series of f converges at x to 


Bf (x+) +f(x—)}. 
Proof. In theorem 9.36 write 
s= Hf(x+)+f/-)}. 


(These limits exist by theorem 6.43.) 

Then ¢(u) has bounded variation in an interval [0, 6] say, and ¢(u) 
tends to 0 as u tends to 0. 

By theorem 6.43 ¢(u) may be expressed as the difference of two 
increasing functions, each of which tends to 0 as u +0. 

The theorem will follow if we prove that, if d(u) is increasing, the 
integral in theorem 9.36 tends to 0 asm > o. 

Given ¢, choose 7 so that (7) < e. By the second mean value 
theorem 6.93 there exists a number & such that 0 < & < 9 and 


| "sin (n+ 2)u d(u) du = $(y) I’ ——S du 


0 Uu 
mr 
= sa) | = du, 


where a = (n +4), b = (n+4)7. This last integral is bounded for all 
n, &, 7, and so the last line is numerically less than Ae. 
By theorem 9.31, there exists n) such that 


f= OY guna 


So the integral in theorem 9.36 is numerically less than (A + 1)e if 
A> yt 


<¢- oho 
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The Gibbs phenomenon. Suppose that the Fourier series of a 
particular function f, which has discontinuities, is known to converge 
to f. We shall probe the manner of approach of the continuous curve 
y = s,(x) to the limit curve y = f(x), as 7 tends to infinity, in the 
vicinity of a discontinuity of f. The characteristic behaviour is seen 
most clearly if we make the simplest possible choice of /, namely 


f(x) = -40 if -7< x < JO, 
I = ee x <2, 
fhas period 27 and f(n7) = 0 for all n. 
The Fourier series of f, which is found to be 
2(sin x +4 sin 3x+4 sin 5x+...), 


converges to f(x) for all values of x, by theorem 9.42. 

It will appear in theorem 9.43 that, if n is large, as x increases from 
the discontinuity 0, the curve y = s,(x) rises at a steep gradient to a 
maximum value b,, at x = 7/2n. We shall prove that 5, is approxi- 
mately, not 47 as might be expected, but the number 


™ sin ¢ 
| ae 
qo 


which is about 18 per cent greater than 47. This phenomenon of 
‘overshooting the mark’ is named after Gibbs (see a historical note 
at the end of the chapter). As x increases beyond the value 7/2n, the 
sum s,(x) will have a sequence of minima and maxima alternately 
below and above the line y = 47. Since s,(x) = s,(7— x), the curve 
y = 5,(x) for 47 < x < 7 is the mirror image in the line x = 37 of 
the curve for 0 < x < $7. 


Theorem 9:43. With the notation of the last two paragraphs, 


™sin t 
rene cae { ames i 
no 0 f 


Proof. 
s(x) = 2(cos x+cos 3x+ ... +cos (2n— 1)x) 


= sin 2nx 
sinx - 


Therefore S,(X) =f" av: mel 
9 sint 
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S,(x) has maxima or minima when sin 2nx = 0, that is, when 
2nx = rm. Insertion of these values in s;,(x) shows that odd values 
of r give maxima and even values minima. 

At the first positive maximum x = 7/2n, 


er Il in Et 


+-—sin—+... + sin 


Sax) = 2| sin an 3 On an—1 on 


n 


_ m{sin (7/2n) , sin (37/2n) sin (2n—1)7/2n 
RK 7/2n 37/2n —— (2n—1)z/2n 


This is seen to be an approximative sum to the required integral if 
we dissect [0, 7] into m equal parts and take the value of (sin f)/t at 
the mid-point of each part. | 


The reader will expect the Gibbs phenomenon at a discontinuity 
to be shown by the Fourier series of a wide class of functions. In 
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order to define such a class we need further knowledge about the 
Fourier series of continuous functions. This will be found in the next 
section, in particular in theorem 9.54; see, then, exercise 9(c), 2. 


Exercises 9(b) 
1. Prove that the function defined by ; 
f(x) = 1/log @|x|-) @& + 0), 
f0) =90 


satisfies at x = 0 the condition of theorem 9.42 (Jordan) but not that of theorem 
9.41 (Dini). 


2. Construct an example in which Dini’s condition is satisfied but not Jordan’s. 


9.5. Divergence of Fourier series 


Theorem 9.51. The Fourier series of a continuous function may 
diverge at a point. 


Proof. Define 


nm sin rx 
t(x, m,n) = 2sin mx > 
r=1 


(m>n> 0). 


We proved in §9.2, example 4, that the sum on the right-hand side 
is bounded for all x, 7. 

Hence t(x, m, n) is bounded for all x, m, n. 

Now ?t(x, m, n) is the cosine polynomial 


Cos (m—n)x , Cos (m—nt1)x , sing OS (m—1)x 
n n—1 ] 


— eee 


1 5 ae = 


Although t(x, m,n) is bounded, nevertheless, at x = 0, the sum 
of the first 7 terms (or the last 7) is numerically greater than log n. 

Take two sequences of positive integers m, and n, withn, < m,, and 
a convergent series = a, of positive terms. Then, from the bounded- 
ness of t(x, m, n), the series 


cos(m+1)x cos(m+2)x _ cos (m+n)x 


ye a, t(x, Myy Ny) (9.51) 
r=1 


converges uniformly for all x to an even continuous function, say 


f(x). 


316 FOURIER SERIES [9.5 


If, for all r, M,+Np < Mpa —Mpars (9.52) 


then the multiples of x occurring in ¢(x, m,,n,) and t(X, M,41, Mp41) 
are all different. 

The series (9.51) is of the form = A, cos vx. Since the sequence of 
its partial sums, when the terms are bracketed as in (9.51), converges 
uniformly, we can multiply by cos vx and integrate term by term 
over [—7, 7]. 

Every integral is 0 except that of the vth term where 

M,—Ny SVS M,+N, 
and we have * 
Oe f(x) cos vx dx. 


The series & A, cos vx is the Fourier series of f and its partial sums 
satisfy the inequality 
1 


| ] 
Sim {0) = a,(1+5+ oe +2) > a, log n,. 


If m, = 2n, and n,,, > 3n, then (9.52) is satisfied. We have 
Sm (0) > 3a, log m, 


and the theorem will be proved if a, log m, tends to infinity with r. 
To ensure this we may take a, = 1/r*? and an even 


m, > max (3m,_, exp r°). | 
The above construction, due to Fejér, defines a continuous 
function whose Fourier series diverges for a specified x. Continuous 


functions can be constructed whose Fourier series diverge for 
infinitely many x. 


Theorem 9.52. Ata point of continuity x of f, we have s,(x) = o(log n). 
This statement is the best possible. 


Proof. With the notation of theorem 9.36, we shall prove 
a sin (n+ yu ay = O(log n). 
0 


Take s = f(x); then é(u) > 0 as u > 0. Given «, there is 77 (less than 1) 
such that |d(u)| < eforO < u < 7. Choose n greater than 1/7. 
Divide the range of integration [0, 7] into the parts 
[0, 1/n], [1/n, 9], [9, 7] 


and call the corresponding integrals /,, J,, J3. 
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In J,, sin (n+4)u < (n+4)u and so 


IL] < n+3)[ [6D | a ‘ (1+5,] . 


g 
Also Ze} < e| ee e log n, 
1/n U 


I, = o(1) by theorem 9.31. 


Adding, we have s,(x) = o(log n). 

To say that this limitation on s,(x) is the best possible means that, 
however slowly y(n) decreases to 0, there is a function f for which, 
at some point of continuity x, s,,(x) is not o{y/(n) log n}. 

This is already contained in the construction of theorem 9.51. 
We have only to impose the additional condition on m, that 
y(m,) < 1}r?. 

Then s,,(0) > 3a, log m, > 34¥(m,) log m,. | 


Fourier series, even if divergent, are not useless. Surprisingly, any 
Fourier series can be integrated term by term. 


Theorem 9.53. If a,, b,, are the Fourier coefficients of f, then 
16 he a, sin = —COS NX) _ [70 ‘ 


Proof. Write ‘ 
ies | “f)dt—Sagx, 


Then F is continuous and of bounded variation. Also, since 


F(x + 27) — F(x) = "#0 dt—ay7 = 0, 


F has period 27. 
By theorem 9.42, the Fourier series of F converges to F(x) for all x. 
The Fourier coefficients A,, B, of F are 


27 
An =| F(x) cos nx dx 
7 J 0 


snux. 1 fe ; | 
= | =|, { f(x) — 4a} sin nx dx 


= 2 Fe) 


= 2 [P09 sin mde = = 58 
= ur} o Xx) SIN NXAaAX = ‘ 
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and 
27 
B, = =| F(x) sin nxdx 
TJ 0 
sa oe nd eee ae Se g 
= 7|-Fe) |, +f { f(x) —4ay} cosnxdx 
1 2a An 
a al, f(x) cos nxdx = as 
So ee A, SID — COs nx 
n=1 
Putting x = 0, we have 0 f 
Lda = 37. 


Corollary 1. The series X (b,,/n) converges. 


Corollary 2. An everywhere convergent trigonometric series need not 
be a Fourier series. 


Proof. By the example following theorem 4.54 (p. 91) 


= sn WX 


converges for all x. If this were a Fourier series, then, by corollary 1, 


= 1 
n=2N log n 


would converge, which is false. | 


Theorem 9.54. In theorem 9.53 the series on the left-hand side 
converges uniformly for all x. 


Proof. Write B,(x) = 6, cos nx—a,, sin nx. 

We shall show that B,,(x) is the coefficient of sin nt in the Fourier 
series in ¢t of f(x+2). 

We have 


i: S(x+90) sin ntdt f(t) sin n(t— x) dt 


-| f(t) sin n(t — x) dt 


= COS nx |" f(t) sin nt dt — sin nx | ; f(t) cosnt dt. 
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Dividing by 7, we have the result. (Similarly | 
A, (x) = a, sin nx+b, cos nx 


is the coefficient of cos nt in the Fourier series in ¢t of f(x + 2).) 
Now we have to prove that = B,,(x)/n converges uniformly. 
From the first part of the proof, 


sin nt 
dt 


n 


5 


ad (+h + Is), 
where the 7’s are the integrals taken over [0, 4], [3, 27 — 4], [27 — 4, 27]. 
We know that, for all x, f, 
If(x+0| < A, 
and, from example 4 of §9.2, for all 7, s, ¢, 


§ sin nt 
2 
oS 


< B. 


Hence each of |J,|, |J3| is at most ABO, and is less than a given ¢ 
by choice of 6. Now = (sin nt/n) converges uniformly for 


6<t < 27-6. 


Therefore we can find r, such that 


S$ sin nt 
2 
- un 


fors > r > r,and all t in [6, 27-6]. We have then |J,| < 27Ae and 


SB, (x) 
- n 


f 


<2e(7+4). | 
7 


Exercises 9(c) 
1. Deduce from theorem 9.53 an alternative proof that two different continuous 
functions cannot have the same Fourier series (cf. theorem 9.12). 


2. Prove that the Gibbs phenomenon holds at x = 0 for the function g = f, +f, 


where x 
Fix) = { f(t)dt, f being any integrable function, 
0 


f(x) = —kin(-—7,0), kin(0,7) (k > 0). 
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9.6. Cesaro and Abel summability of series 
Consider the infinite series 


1-1+1-14+1-.... 
If 's,, is the sum of the first n terms, 
S, = 1 (nodd), Ss, = 0: -@reven). 
The average (arithmetic mean) of the sums 5,, is 


oe Sy+ eee “+ d,, 
n 


n 


and c= 5(1 +) (n odd), c, = : (n even). 
Thus o,, tends to 4 as n tends to 0. 

In the passage from s,, to o,, oscillations have been ironed out by 
cancellation of positive and negative terms of the original series. 
With increasing n the trigonometric functions sin nx and cos nx go 
through positive and negative values, and it is plausible that averag- 
ing the sums of a divergent trigonometric series may result in a 
convergent sequence. 

This section will deal with series of constant terms. In §9.7 we 
shall discuss trigonometric series. 


Definition. Let s, = Uj+Ugt+ ... tu, and 


ae Sy +So+ eco + Sy 


= n 


Then o.,, is the (C,1) mean of the s, and, if o,, tends to l as n tends to 
‘infinity, we say that 5 u,, is summable (C, 1) fo /. 

n=1 
C refers to Cesaro (1859-1906). More generally a (C, &) mean can 


be defined; if k is a positive integer it is essentially a k-times repeated 
average. We shall confine ourselves to the case k = 1. 


Theorem 9.61. If & u,, converges, it is summable (C, 1) to the same 
sum, 

We omit the proof which is a straightforward exercise on limits (see 
exercise 4(b), 9(1)). 


Theorem 9.62. If Xu, is summable (C,1), then s, = o(n) and 
u, = O(n). 
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Proof. The first conclusion follows from 
S, = no, —(n—1)o,,-1, 
and then the second from 
ee eet ee 


Theorem 9.62, a limitation on the magnitude of u,, states that if 
the terms of a series are too large, the Cesaro averaging is too weak 
to sum it. 

It is much less clear that if the terms of a series are too small, the 
(C, 1) method will not serve any useful purpose—it will apply only 
to convergent series. The next theorem states a limitation of magni- 
tude of u,, under which summability (C, 1) of u,, implies convergence. 
It will not be needed in the rest of the book, but it is an excellent 
exercise in analysis. 


Theorem 9.63. (Hardy, 1910.) If u, = O(1/n) and Xu,, is summable 
(C, 1), then it converges. 


Proof. 

(a) We separate out two preliminary results (5), (c), in which the 
hypothesis u,, = O(1/n) is not used. 

(b) Let t, = u,+2u,+ ... +nu,. A necessary and sufficient con- 
dition that the series Xu,,, summable (C, 1), should converge is that 
p= 07). 

This follows from t, = (n+1)s,—n0,. 

(c) A necessary and sufficient condition that 2w, should be 
summable (C, 1) is the convergence of 


ln 
Lint i¢ 


This follows from 
a n n—| 
Hei)” nti OO 
(d) Using (b), (c), we prove the theorem. Suppose that Xu, does 
not converge. From (bd), there is a positive A such that either 
ty > AN or ty < —AN 
(assume the former) for infinitely many N. 
Now, for any 7, 
tnar = tn t(nt unas > t—8B, 
since u,, = O(1/n). So, if ty > AN, we have ty,, > AN— Br. Then | 
tvir > 3AN if r < 4AN/B. 
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Therefore 
N+r t N+P 1 r+ 1 
n 1 eS eee 
+ n(nt+l)~ ee = nn+1) * N+r+1 
If we take r to be the integer next less than 44.N/B, we have 
N+r i. C 
“ n(n +1) oe 


where C depends on A and B but not on N. Hence the series 
xt,/n(n+ 1) diverges and by (c), Xu, is not summable (C, 1). | 


There are many methods of ‘summing’ series besides that of taking 
arithmetic means. If the reader will turn back to theorem 5.47 (Abel’s 
limit theorem) he will realize that the damping factors r”(r < 1) 
suggest another method. 


Definition. If, asr > 1-, 


fe a) 
YY u,r” > 1, 
n=0 
foe) 


then we say that >) u,, is summable (A) to /. 


Theorem 587 dao that every convergent series is summable (A). 
Illustrations. The series 
1—-1+1-1+... and 1-—2+3-4+4+5-... 
are summable (A) to 4 and } respectively. 
_ We shall prove that (A) is a more powerful method of summation 
than (C, 1); that is to say, any series which is summable (C, 1) is 


summable (A). Moreover there are series which are summable (A) 
but not summable (C, 1); such a series is 


1—2+3-4+4+5-..., 
which has just been mentioned. 


Theorem 9.64, A series which is summable (C, 1) is summable (A) to 
the same sum. 


Notation. We shall take the series to be }) u,,, starting with a term 
0 


Uy, aS is Appropriate to summability (A). This entails a modification 
of the formulae for (C, 1) sums at the beginning of §9.6, where it was 
preferable to start with u,. We write henceforward 

a Te ee 


5, = 0% and. ¢, =... 
0 n 
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Proof. By theorem 9.62, u, = o(m) and so 2u,x” converges 
absolutely for 0 < x < 1. 

By two multiplications of absolutely convergent series we have 
(C1, theorem 5.7, corollary) 


(—x)?* 3 ux” =U -x)>| 022 ut u,"| 
0 0 0 
ms - sg 5: i.° = Done, x. 
0 0 1 
Therefore (supposing throughout that 0 < x < 1), 
8X Sh we ye, xX. 
0 1 


The binomial expansion of (1 —x)-? (or the substitution 


=0 (@ 21) 
in the last-line) gives 
Leif ~ x)? Enxt-, 
1 


Multiplying this equation by / and subtracting from the one before, 

we have = 0 

Yu,x°-l = 1-xP Dn(e,—-) x. 

0 1 

Suppose that / is the (C, 1) sum of > u,. Then, given e > 0, there 
0 


is an N such that |o,,—/| < eifn > N. Now 


Su, Pet aay Ss =x (2+ | +> » ) mo —])xn-1, 


In this, > n(o,,—1) x 
N+1 


<e > nee? < ef] xy 4. 
N+1 
Keeping WN fixed, and — x > 1-, we have 


x"-—I| < «€. 


xz—1— 


Since ¢ is arbitrary, u,, is summable (A) to /. | 


9.7. Summability of Fourier series 


Theorems 9.51 and 9.52 are confessions of failure to determine a 
function as the limit of partial sums of its Fourier series. To a 
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physicist this would seem lamentable; the results of his observations 
in spectral analysis are the Fourier coefficients of the functions that 
he seeks. We are now in a position to show that, though the sequence 
of sums s,(x) may fail, the sequence of means @,,(x) succeeds in 
reconstructing a continuous /. This result of Fejér was a landmark 
in the theory. 


Definition. 
$00) Fn. FAD 
: : 


o,(x) = 
Note that the first suffix is 0, not 1. 


Theorem 9.71, For the Fourier series of f, 
aa(x) =F |” Kyle flew) + flew} 


where the Fejér kernel K,(u) = sin? 4nu/2n sin? 4u 
Proof. mee the Dirichlet integral (theorem 9.33) for s,, we have 
n—1 sin (r+3)u 


74x) == [15 


r=o 2Sin 4u 


| {fGe+4) +f(x—u)}du. (9.71) 


Sum the trigonometric series by the formula 


2 sin 4u sin (r+4)u = cosru—cos(r+l1)u. | 


Corollary 1. , K,(u)du = 37. 


The important property of the Fejér kernel K is that it is always 
positive (or zero), in contrast with the Dirichlet kernel which takes 
both signs. This fact yields wea such as the following: 


if < yu) < 
then iam alk K,(u) Y(u)du < 47M 


The following corollary is analogous to theorem 9.34. 


Corollary 2. A necessary and sufficient condition that the Fourier series 
of f is summable (C, 1) to s at x is that 

1 (7 sin? 5nu 

aan Jo Sha 


where ¢(u) = f(x+u)+f(x—u) —2s. 


g(u)du = 0, 
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Theorem 9.72. (Fejér, 1904.) The Fourier series of f is summable 
(C, 1) to Hf(x+)+f(x—)} whenever this expression has a meaning, 
and in particular to f(x) at every point x of continuity. 


Proof. Putting s = 1{f(x+)+/(x—)}, we have d(u) > 0 asu>0. 
Given ¢, there is an y(< 7) such that |A(u)| < € for |u| < 7. 
Dividing the range of integration [0, 7] at 7,.we have 


e prsin® gnu 5 1 (7 |e) 
b(u)du) <7) as rep du+- | a, du. (9.72) 


I 7 sin? 4nu 
no sin? $u 


The first term on the right is less than 
é (7 sin? 
g { 4nu a 


n)jo sin? 4u 


which is ze. The second term tends to 0 as n tends to ©. | 


Corollary 1. If f is continuous in an interval (a,b) anda <c<d< b, 
then o,(x) f(x) uniformly in the sub-interval [c, d]. 


Proof. Given e, there is 7 independent of x such that, for all x in 
as foe+w)—fe)| < de if ul <1. 
Hence |¢(u)| < ¢ if |u| < 7, which is the estimate required for the 
first term on the right in (9.72). Also |¢(u)| < A, a constant inde- 


pendent of x, u, and the second term on the right of (9.72) is less 
than B/n. | 


Corollary 2. (Weierstrass’s approximation theorem for trigono- 
metric polynomials.) Jf f is continuous and has period 27, then given e, 
there is a trigonometric polynomial t(x) for which |f(x)— i << 
for all x. 

This is the analogue of theorem 5.52 for algebraic polynomials. 
See also exercise 5(f), 2. 


Summability (A) of Fourier series. 


Lemma. If 0 < r < 1, then 
222 a sues: 
2(1—2r cos 6+1r?)’ 


(ii) for fixed r, the series converges uniformly in 0. 


cO 
(i) 4+ Sr” cos nd = 
n=1 


Proof. (i) Writing z = re’’, we have : 


- 1 tre # 
- a VAL = —— — i — ee i 
“ - l—-z * 1—2rcos6+r2 ? 
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We now take real parts. 
(ii) |r” cos n6| < r” and Xr” converges. 
Uniformity follows from theorem 5.32 (M-test). | 


Theorem 9.73. If f(x) has Fourier series >, A,,(x), then, for0 < r < 1, 
0 
a | roost = FAG) 
In), 1—2r cos (t—x) +P Pe ahi sensei 
Proof. From the lemma, 
Sse 
2(1 — 2r cos (t— x) +7?) 
We can now multiply by the bounded function /(/) and integrate 
term by term over [—7, 7]. | 


= 44+ Yr" cos n(t—x). 
4 


The left-hand side in theorem 9.73 is called the Poisson integral 
of f. 
Theorem 9.74. 
i Big l-r 


iin Se a = eee 


wai 24) «lO Xe 


whenever the right-hand side has a meaning. In particular the right- 
hand side is f(x) at every point x of continuity. 


Proof. By theorem 9.72, >| A,(x) is summable (C, 1) to 
0 


Hf (x+) +f(x—)}. 
By theorem 9.64, it is summable (A). Theorem 9.74 then follows from 
» 


Exercises 9(d) 


1. The power series Xa,z” has radius of convergence R. Can it be summable 
(C, 1) for a value of z with |z| > R? 

For what values of z is the geometric series &z” summable (C, 1)? 

Define uniform summability (C, 1). Specify a set of z in which &z” is uniformly 
summable. 


2. Prove that the summability (A) of Xa, implies its convergence if 


either (i) a, > 0 for all x, 
or (ii) a, = o(1/n) (Tauber). 


(The much deeper result with O in place of o in (ii) was proved by Littlewood.) 
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3. Adapt the proof of theorem 9.64 to prove the following theorem. Suppose 
that (i) Xd, is a divergent series of positive terms, (ii) &=d,,x”" converges for 
|x| < 1, (iii) as n > ©, lim (c,/d,) = k (+0). Then, asx >1-, 


Zilli, X 


Sd, x" +k, 


4. Satisfy yourself that : 
x x 
lim : (1 ~ *) f(x)dx 


X= 


is a suitable definition of (C, 1) { f(x) dx. 
0 
Prove that { x® sin x dx exists in the (C, 1) sense if 0 < k < 1. 
0 


Can you suggest a definition of summability (A) of [ F(x)dx? 
0 


9.8. Mean square approximation. Parseval’s theorem 


We saw in §9.5 that, for some functions f, the Fourier sums 5, give 
poor approximations to fin the sense of the metric 


sup |f(x)—sn(x)|- 


—MLaTN 


Indeed at particular points x the difference | f(x)—s,(x)| may tend 
to infinity with n. 

We shall show in this section that the sums s,, give good approxi- 
mations to f as measured by 


[7 ¥eo-s,coyrax 


Among all trigonometric polynomials ¢, of degree at most n, the 
Fourier sum s, minimizes the integral 


| ft) 


This property is shared by expansions in sets of functions other 
than trigonometric functions. It is easier to write out a proof under 
general conditions. 


Orthogonal and orthonormal sets of functions. 


Definition. The functions $, for n = 1,2,..., Riemann integrable 
over [a, b], form an orthogonal set if 


{ “Gul2)Gu(e)dx = 0 (m +m) 
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If, in addition, 


b 
[ Code = 1 = 1,209, 

the set (or system) ¢,, is orthonormal. 
Definition. The number b 
en = | flv b(adae 
is the n-th Fourier coefficient of f for the orthonormal set ¢,,, and 

X Cn Pn 

i 
is the Fourier series of f. We write 8, = DiC» Pye 

1 


The trigonometric functions 


l : 
Wan)’ Ja COS NX, Ja sin 1X 


form an orthonormal set in [—7, 7]. 
Theorems 9.81 and 9.82 will be stated for a general orthonormal 
set @,, in [a, 5]. 


Theorem 9.81. If t,, = d,,, then, for fixed n and arbitrary d,, the 
1 


integral 


b 
| f@)- 1.0} 
b 
attains its least value when d, = c, = | IO) GAx as for’ a2”... 2. 
Proof. All integrals are over [a, b] and all sums from 1 to n. 


-2]fn+|+2[s,—[ st 


= —2%0¢,d,+id2?+20c-=e2 
X(c,— d,)?. 


[r- 48 [P—s0 


The sum is never negative and is 0 only when ¢, = s,. | 


Theorem 9.82. (Bessel’s inequality.) 


[o @) 
LG < {- 
1 
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Proof. We showed in the proof of theorem 9.81 that, for all n, 


[o-sot = [P- Ee2 
The left-hand side is not negative. | 
Corollary 1. In the trigonometric case 
d+ D(a +b) <2 [" Pe 
Corollary 2. Asn —> ©, c, > 9. 


The trigonometric case of Corollary 2 (a, —>0,5,—0) was 
proved differently in theorem 9.31. 

For some (not all) orthonormal sets the inequality in theorem 9.82 
can be strengthened to equality. This is true for the set of trigono- 
metric functions and we proceed to prove it. The most direct argu- 
ment depends on Fejér’s theorem 9.72. 


Theorem 9.83. (Parseval’s identity.) If the a,, b, are Fourier coeffici- 
ents of f, then “ (7 
a+ S(ai+by =>" f 


Proof. We have as in theorem 9.82 (for trigonometric functions) 
o<[ p—alaag+ si@+s9} =|" (f-s0% 
From theorem 9.81, if t,, is a polynomial of degree n, and k > n, 
[7 fsa < [7 00) 
Theorem 9.83 will follow if, given ¢, we can find a t, such that 
[7 fata? < Ke 


where K is a constant which may depend on f. This we set out to do. 
(i) If f is continuous, by theorem 9.72, corollary 1, there is a o,, 


for which | f(x) -—o,(x)| < J(e/27) for all x. 


Hence we have i (f—c,)? < ¢. 
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(ii) Assume only that f is integrable. With the usual notation, 
there is a dissection of [—7, 7] for which 


Si a) ee 
i=1 


Define a continuous function ¢ such that 
P(X;) = f(x;) for i= 0, 1, eeeg N, 


and ¢ is linear in each [x,;_,, x,]. Then, all the integrals being over 
[= 7, 77), : 


\f-9| < 2(M,;—m,)%;—-x:-) < 6, 
so that, if M = sup |f(x)| = sup |d(x)|, 


[7-8 < [2M |f-9| < Me, 
Since ¢ is continuous, there is by (i) a trigonometric polynomial t for 
which 
| @-o < €. 
From the inequality (a+b)? < 2(a?+56?) we have 
[or-o < 2] (f-9+2[@-0" 
< (4M +2). | | 


Exercises 9(e) 
1. If (a,, b,) are the Fourier coefficients of fand (c,, d,) those of g, prove that 


: { fe = 4ayco+ S(Qnent+ Bad). 
—7 1 


2. By Parseval’s theorem or otherwise prove that 


2 = 1 78 
= 90° © Gn—i = 960° 
3. Prove that, if k + 0, 
x 1 _ 34+k*n*— 3k coth kn 
T n(n2+k?) 6k4 : 


9.9. Fourier integrals 


We now attempt the analysis into harmonic components of a 
function f defined for —0o < x < o, which possesses no periods. 
This analysis will involve an integral in place of a series. 
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The requisite integral was discovered by Fourier himself from the 
following heuristic argument. Suppose that f has period 27A (where 
A will later tend to infinity) and that its Fourier series is 


1 . nx =, (ex 
4d + 2 (a, cos 1 +b,, sin 7 ; 
1 TA d 
where i, = al. f(padt 
1 (7 nt 1 7 ce 
and a, = 5] fo cos ~ dt, D, = a) £0 sin 5 at. 


Write ~ = u, and : = du, so that 


du (7 ; 
a, = = { f(t) cos u,tdt (and b,, likewise). 
—mAa 
Let now A > oo and we conjecture that 


igi = 3 {a(u) cos ux + b(u) sin ux} du, 
where 


ee : { * f(@ cos utdt, bu) =* { fa sinuid 690 
or, finally, = 12 00 : 
flx) =* | “du | f(t) cos u(t—x)dt. (9.92) 


To inject rigour into Fourier’s analysis would be troublesome. 
Instead, we start afresh and investigate the formula (9.92) on lines 
parallel to those which guided us in §$9.3, 9.4. 

Throughout §9.9 we make the assumption that { | f(t) | dt exists. 


Then the integrals in (9.91) exist for all values of u and the inner 
integral in (9.92) exists for all u, x. Moreover (by theorem 6.35) the 
convergence of the infinite integrals in (9.91) is uniform with respect 
to u for all u. 

The reader will prove without difficulty (exercise 9(f), 1) that 
a(u), b(u) are continuous functions of u. 

We first extend theorem 9.31 (Riemann—Lebesgue). 


Theorem 9.91. If | : | f(x) | dx exists, then 


3 F(x) cos Axdx +0 and 2 f(x) sin Ax dx > 0 


as A> oo. 
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Proof (for the cosine integral). Given e, there is X such that 
Xs foe) 
[lft] fi <e 
=o B.¢ 
By theorem 9.31 there is Ay = Ao(X, €) = A,(e) such that 


is f(x) cos Axdx 


=e for A> Ay 


Hence | s f(x) cos Axdx| < 2e for A> Aj, | 


[9.9 


The next theorem is the counterpart of theorem 9.33 (Dirichlet’s 


integral). 


Theorem 9.92. 
le de Ss oe ee -{- aa 


Proof. The inner integral on the left converges uniformly in uw. 
Hence, from theorem 8.74 (the hypothesis (i) of which is satisfied 


from exercise 8(c), 2), the left-hand side is equal to 


ee) U 
fe f(t) at cos u(t — x) du, 


which is the right-hand side. | 


We now define A(t) = f(xt+0H+f(x—1)-2s 


and follow the steps leading from theorem 9.33 to 9.34 and 9.35, 


obtaining two theorems. 


Theorem 9.93. A necessary and sufficient condition that 


sf aul AD) ROSE de = 5 


sin — 


is that lim of t) it = 0: 


U—co 
Proof. 


[fo at = [eens fer-9} 


sin a 


=f “ne St dt+2s| sin Ut 4, 
0 t 0 t 


and this last integral is 47 from §8.7 (exe-1ple). | 
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Theorem 9.94. (Localization theorem.) The convergence or diver- 
gence as U -> © of the integral 


U oe 
| du | f(t) cos u(t — x) dt 
0 —© 


depends only on the values of f in an.arbitrarily small neighbourhood 
(x—0,x+0) of x. 


Proof. Following the proof of theorem 9.35, we decompose 


[vets se-9 5 


) oa) 

into | +| . The function | f(x +1)+f(x—1)|/t is integrable in (6, 20), 
0 6 

because | /| is integrable in (— 00, 00). By theorem 9.91, 


sin Ut 


E {f(x+0) co a 


tends to 0 as U tends to o. 
Therefore, as U > ©, 


(ws fo) 
| du | f(t) cos u(t — x) dt 
0 —o2 


and [. du [- ; f(t) cos u(t — x) dt 


both converge to the same limit or both diverge. The latter integral 
depends only on the values, of fin (x- 6, x+9). | 


We are now in a position to write down the criterion for integrals 
analogous to that of theorem,9.36 for series. 


Theorem 9.95. A necessary and sufficient condition that 
Al du | f(t) cosu(t—x)dt =s 
0 —2o ; 


is that, for some 6 > QO, 


~ af P(t) 


sin —d 


= 0. 


Proof. 


[, g(r at = -|- (fet + fee— 9H dt—2s pas 


sin sin Ut 
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The first integral tends to 0 as U > o0 by the proof of theorem 9.94. 


. pos 
The integral in the second term is | dv, which tends to 0. | 
Us 


Theorem 9.95 shows that the Fourier integral of f converges at a 
point x if f (restricted to an interval containing x) satisfies any known 
set of sufficient conditions for the convergence at x of its Fourier 
series, for instance those in theorem 9.41 or 9.42. 


Fourier transforms. Suppose that fis defined on [0, 00). Let fZ be 
the even function on (— ©, 0) coinciding with f on [0, 00). From 
equations (9.91), 


a(u) = : | s FAt) cos utat = | F(t) cos utdt, 


b(u) = : | a f(t) sin utdt = 0. 


If now f, as well as being absolutely integrable over [0, 00), satisfies 
at x sufficient conditions for the convergence of Fourier series (such 
as Dini’s or Jordan’s) we have 


I(x) = | ; a(u) cos uxdu. 


The formulae for a(u) and f(x) become quite symmetrical if we 
adjust the numerical multiplier and write 


a(u) = 2)" S(t) cos utdt, 


I(x) = (?)’ | z a(u) cos uxdu. 


77 0 


Two functions such as f(x), a(u) linked by this reciprocal relation 
are called cosine transforms of each other. 
Similarly an odd function gives rise to a pair of Fourier sine 


transforms + peo 
i= (=) | “ b(u) sin uscd, 


where b(u) = (2)' { : F(#) sin ut dt. 


It is in the shape of transforms that Fourier integrals are most 
readily used. 
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It will be observed that, though the formulae are symmetrical, the 
properties of the functions f(x) and a(u) (or b(u)) differ. To attain 
further symmetry it is necessary to use the Lebesgue integral. 

Transforms according to other definitions (notably Laplace trans- 
forms) are also important in analysis and in mathematical physics. 


Exercises of ) 
1. Prove that, in (9.91), the functions a(u) and b(u) are continuous. 


2. Find the function b(u) satisfying the integral equation 
F(x) =| b(u) sin ux du ix > 0), 
0 


given that, for x > 0, f(x) is 
G) 1 for O<x<z7, 
QO. for. x > 7, 
(ii) e-*, 
(iii) xe-**, where k > 0. 
3. Prove that, if b > 0, 


© COS ax 7 : 
[, ape = a: ia >: 


What is the value if a < 0? 


Further exercises on Fourier and Laplace transforms are in 14(a), 16-19, 
when results obtained by complex integration are available. 

The reader may verify that the complex-exponential version of Fourier 
transforms (in the sense of the note at the end of §9.1) is, with the same hypotheses 
on f as for cosine or sine transforms, 


1\? fo 
—— (;-) \ Sem de 
£ 4 oe) 
GSS (;-) | © ewer du, 


NOTES ON CHAPTER 9 


§9.1. (History.) The physical problem which impelled the study of trigono- 
metric series was that of the transverse vibration of a string, with ends fixed at 
the points (0, 0), (/, 0). Mathematicians of the 18th century were in possession 
of the differential equation 


oy vy 


ar? = C ax? (c constant) 


satisfied by the configuration y = y(x, t) of the string. Here y(0, ¢) = y(/, t) = 0 
for all ¢. 
In 1747 d’Alembert, by the change of variables 


E=xt+ct, y= x-ct, 
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woe 
ef Oy 
which is satisfied by y= o)+v(y) 
= &(x+ct)+YW(x-ct), 


transformed the equation to 0, 


where the arbitrary functions ¢, ¥ are to be determined by the initial shape and 
motion of the string. Suppose, following d’Alembert, that, when ¢ = 0, the 
string is at rest in the configuration y = f(x) for 0 < x < 1. Then 


P(x) + W(x) = f). 
Since the velocity at every point is 0 when ¢ = 0, 
ch'(x)—cy'(x) = 0, 
and this can be integrated to give 
P(x) = W(x). 


In this way d’Alembert was led to the solution for 0 < x < /and all positive 1, 
y = 4f(xt+at)+4f(x—-at). 


At about the same time, Euler made a similar analysis of the problem. The 
two men differed, however, in their concept of a function. Was an analytical 
expression required (as d’Alembert held) or was a specified graph an adequate 
definition (Euler)? Essentially, d’Alembert’s notion of a function was too narrow 
(needing in his hands derivatives of every order) and Euler’s was too vague. 

In 1753, Daniel Bernoulli used the method of separating variables to find 
particular solutions y = u(x)v(t) of the wave equation. The equation is satisfied 
by u(x) = sin px or cos px, and v(t) = sin pct or cos pct, where p is a constant. The 
condition y(0,t) = 0 (all t) excludes cospx, and the condition y(/, t) = 0 
restricts p to the values n7/l. 

Since the equation is linear, particular solutions may be added to give 


pe > Sin = (2, cos cine Sa sin | : 
Assume, with d’Alembert and Euler, that when ¢ = 0, the string is at rest in 
the form y = f(x). We have b,, = 0 (all n) and 
1.0 = Fae oe 
n=1 l l 
Bernoulli asserted this value of y(x, t) to be the general solution and so it had to 
include the d’Alembert—Euler solution. This implied (for ¢ = 0) that an arbitrary 
function f(x) in [0, 7] could be expressed as 


Lo @) 
Dd a, sin nx, 
n=1 
but this was unacceptable to both d’Alembert and Euler. 
The 18th century mathematicians summed a number of infinite trigonometric 
series, and their results, though lacking rigorous proof, were often right. As 
instances, Euler stated that 


sin x—4 sin 2x+4 sin 3x-... 
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had sum 4x, but apparently believed this to hold for all x instead of for 
—7m < x <7 only. Bernoulli gave the sum 4(7— x) for 


sin x +4 sin 2x+4 sin 3x+..., 


with the correct range of validity 0 < x < 2z7. 

The next advance came in 1807 when Fourier proved in many special cases 
that a function f, which might have discontinuities, could be expanded in a 
convergent trigonometric series. Proofs of the possibility of expanding a general 
function were attempted by Fourier himself, Poisson, and Cauchy. Fourier’s 
contributions were recognized by the attachment of his name to the trigono- 
metric series associated with a given function. The first real proof that a wide 
class of functions could be expressed as trigonometric series is due to Dirichlet 
(1829). 

Since the days of the early 19th century analysts, the mutual interaction of 
trigonometric series with the main branches of analysis has been fruitful. As 
illustrations we cite (with names and rough dates) the topics of integration and 
differentiation (Riemann 1850), sets of points (Cantor 1880), functions of bounded 
variation (Jordan 1890), summability of divergent series (Fejér 1904), integration 
and differentiation (a second wave, Lebesgue 1900). If theorems about trigono- 
metric series are to be free from restrictions not inherent in the problems, it is 
necessary to interpret integration in the sense of Lebesgue. The Lebesgue integral 
is not included in this book, and in this chapter we do the best we can with 
integrals in the Riemann sense. 


§9.4. Gibbs pointed out the phenomenon in 1899. It had however been observed 
by Wilbraham in 1848. There is a good chapter, with illuminating diagrams, 
in Carslaw, Fourier Series and Integrals. 


§9.6. The student should read the historical account given in Hardy, Divergent 
Series. This contains a section on Fourier and his theorem (p. 29). 


10 
COMPLEX FUNCTION THEORY 


10.1. Complex numbers and functions 


The rest of the book will be devoted to a closer study of complex 
functions, namely functions whose domain of definition and range 
are both in Z. 

The function value f(z) will usually be called w, and we shall keep 
tp Wig Rotation z= x+iy = r(cos 0+isin 9), 
w= ut+iv = p(cos d+isin ¢). 


Numbers such as a and b in w = az+5 will in general be complex 
unless common usage proclaims their reality (as for u, v, p, @ above). 

The reader will be familiar with the definitions and properties of 
the conjugate, Z = x—iy; the modulus, |z| = r = ./(x*?+y"); and the 
phase, the angle 6 for which cos @ = x/r and sin@ = y/r, where 
r + 0. The phase has infinitely many values differing by multiples 
of 27. We shall write Phz (with a capital P) for any of these values. 
It is often convenient to have a principal value of Phz, and this is 
defined to be the value such that —7 < Phz < 7. The principal — 
value will be written phz with a small p. 

We write an equation such as 


Ph(z,2.) = Phz,+Phz, 
as being true modulo 27, that is to say, the two sides may differ by an 
integral multiple of 27. The reader may verify that 
ph(z,z,.) = phz,+phz,+2kz, 
where k may be 0 or 1 or —1, depending on z, and Z,. 
Writing w = f(z) as ne fb), 


we see that this is equivalent to a pair of real functions of two real 


variables u=u(x,y), v = (x,y), 


defined for a given set of pairs (x, y). This notion of a function of z 
is too wide to be the most useful. For instance, functions so defined 
cannot in general be differentiated if we define, as seems natural, 


f(z) = lim (6w/dz). 
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Illustrations. (1) w = 4(z+Z), that is w = x,is a function of z in the sense that z 
determines w uniquely. But, in notation which explains itself, if dz has phase 6, 


dl _ |x 
Oz Oz 


= |cos 6, 


and so the modulus of dw/dz takes all values between 0 and 1 according to the 
phase of 6z, and dw/dz does not tend to a limit as 6z > 0. 

(2) If, however, w = z?, then, as 6z > 0, dw/dz tends to the limit 2z. The 
reasoning is just like that which proves that, if y = x’, then dy/dx tends to 2x. 


Usually the domain of definition of a function of z is a region 
(defined in §3.3 to be a connected open set). Continuity of f for a 
given z then means that lim f(z +h) = f(z) as A tends to O through 
complex values. 


Definition of derivative. If, for a given z, 


Lec -fo 


tends to a limit as h tends to 0, then f is said to be differentiable for that 
z, and the limit, written f'(z), is the derivative. 


Theorem 10.1. Necessary and sufficient conditions for f to be differ- 
entiable at a given point z are that u and v are differentiable (in the 
sense of 7.11) and their partial derivatives satisfy the Cauchy—Riemann 
equations 

Up = Vy Uy = —Uz 
at the corresponding (x, y). 


Proof. We prove necessity. Let f’(z) = p+ig. Write h = k+il. By 
definition of f’(z), 


fE+h)-f@ = (p+ig(k+il) +o +P}. 
Taking real and imaginary parts, we have 

uxt+k, y+lD—u(x, y) = pk—qlt of/(k? +2}, 

u(xt+k, yt+l—v(x, y) = qk+pl+ol/(k?+P}. 


Hence u(x, y) and v(x, y) are differentiable in the sense of (7.11) or 
C1 (156) and 


U, =D =Vy Uy = —G = —U;. 


To prove sufficiency, we reverse the steps of the argument. | 


Notes. (1) Comparing the definition of f’(z) with that of the linear 
derivative A in (7.12) we see that the linear function A(h) is f'(z)h. 
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The connection between the mappings from Z to Z and from R? to R? 
is elaborated in a note at the end of the chapter. 

(2) In complex function theory the notation u, for the partial 
derivative 0u/0x is more expressive than the general notation D,f 
which is best in chapter 7. 

(3) Taking dz to be dx and idy in turn, we have 


ie Se ; 
f'(2) ps Ox = Uy t Wy 
and T= -iZ = V,— ily. 


(4) If the first partial derivatives of u and v are differentiable 
(which by theorem 7.84, is a sufficient condition that u,, = uy,), then 
the Cauchy—Riemann equations show that 

Ugg = Upy = —Uyys 
that is to say, u (and similarly v) satisfies Laplace’s equation in two 
variables ay au F 
ax2' aya 


Functions which satisfy this equation are called harmonic functions. 
They play a large part in both pure mathematics and physics. 


Manipulative properties of complex derivatives are in general 
straightforward adaptations of those of real derivatives, for instance 


£ fig(2)} = f'te}'©). 


The following examples illustrate methods of solving some 
problems suggested by this section. 


Example 1. If, for all z in a region D, f'(z) = 0, then f is constant 
in D. 
Proof. By note (3) above, at every point of D, 
Uy = Uy = UV, = v, = 0. 
If the segment (x, y,), (X, Ye) is in D, the mean value theorem shows 


that 
U(X, Yo) — U(X, V1) = Va- Vi) uy (x, 1) = 9, 


where y; < 7 < yo. 
If a and b are any two points of D, they can, by theorem 3.32, be 
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joined by a polygon whose sides are horizontal or vertical. The above 
argument applied to u and v along these sides gives f(b) = f(a), as 
required. | 


This splitting into real and imaginary parts is necessary because 
(§7.3) Rolle’s theorem and the mean value theorem do not extend 
directly to a function from R? to R? (or from Z to Z). It is possible, 
however, to prove an inequality form of the mean value theorem on 
the model of theorem 7.31 (see exercise 10(a), 10). 


Example 2. Find f(z), differentiable for all z except z = 0, having 


real part x Ey 


x? + y? 


(which satisfies Laplace’s equation except for x = y = 0). 
Solution. The neatest way is to exploit conjugates, 
2u(x, y) = (ut+iv)+(u—iv) = f(z) +f(2). 
Writing 2x = z+Z and 2iy = z—Z, we have 


pies z+Z—-i(z—2) * a | ied 
2Z Zz Z 
Hence f(z) = +t is a solution; and iA, where A is an arbitrary real 


constant, may be added to it. 


10.2. Regular functions 


In §10.1 we indicated that the existence of the derivative f’ is 
necessary if complex functions f are to have interesting properties. 
For instance, 


w= aah er w= -expz 
~ ez+d Bors 
are acceptable, the former being differentiable except for z = —(d/c) 


and the latter for all z. We shall investigate both of these functions 
in §§10.3-10.6. 

Suppose now that w? = z. Corresponding to a given z there are 
two values of w (if z + 0). We were faced with this ambiguity in 
discussing the relation y? = x between real numbers. We could then 
agree that y? = x comprises two separate functions, namely (for 


x 2 0) y= +x and y= —,/x, 
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and each of these could be investigated in its own right. From w? = z 
in complex variables the two values of w cannot be so readily dis- 
entangled, and the reader should go into the following details in 
order to appreciate this fact. 

Write 


z=r(cos0+isin@) and w = p(cos ¢+isin 9), 


where r > 0 and p > 0. Equating moduli and phases in w? = z, we 
nave p?-=r and 2¢ = 6+2nz, 
where n may be any integer. The values 

@¢=40 and ¢=30+4+7 


give two distinct values of w. Taking as an illustration the special 
case r = p = 1, suppose that 0 < @ < 27. Write 


= cos 40+isin40, w, = cos (40+7)+isin 40+7). 


If z moves once round the circle |z| = 1, one of the two correspond- 
ing values of w moves from w, round to w, and the other from w, 
to Wj. 

In tracing the values of w we have for simplicity moved z round a 
circle with centre z = 0. The phenomenon of interchange of values 
occurs if z follows any path which winds round the origin z = 0. 

A point, like z = 0 here, about which values of w interchange is 
called a branch point. The different values of w (there being two in 
this illustration) generate branches. 

In order to define a function of z satisfying the rélnes Ww = 2, we 
must restrict the domain of definition to one which contains no 
path winding round the origin. Such a domain is the z-plane with 
the negative real axis (y = 0,x < 0) deleted. We shall call this 
region the z-plane cut along the negative real axis. Any ray from the 
origin would serve equally well for the cut. If then w is defined to be 
(say) —1 when z = 1, the relation w? = z determines a continuous 
function of z in the cut plane. We shall meet another instance of 
multiplicity of values in §10.6 and we shall take the problem up 
more systematically in §13.3. 

When writing in the sequel w = f(z) we shall always understand 
that, for any z in the region (or set of points) contemplated, w is 
defined uniquely. 


Definition. f is regular in a region D if f’(z) exists at every point z 
of D. | 
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Definition. f is regular at the point a if there is an open disc containing 
a in which f is regular. | 

The reader will remark that the order of these definitions is 
opposite to that customary for real functions. In defining, say, 
continuity of a function f: R! > R', we first define continuity at a 
point x = a. Afterwards we define continuity in an interval to mean 
continuity at each point of it. The reason for the emphasis on a 
region is that (as we shall prove in the next chapter) differentiability 
of f(z) in a region entails the existence of higher derivatives as well. 
No such further implication holds for a real function (x). 

On the other hand functions regular in a region or at a point have 
most properties that our knowledge of differentiable real functions 
would lead us to expect. Regularity is preserved under algebraic 
operations; for example the quotient f/g of two regular functions is 
regular except at points where g vanishes. 


Illustration. If n is a positive integer, z” is regular for all z, z~” is regular for 
all z except z = 0. 


An extensive class of regular functions is formed by those defined 
as power series or known to be expressible as power series. We prove 
this. 


~ 

Theorem 10.2. If the power series >, a,,z" converges for |z| < R, then 
; 

its sum f(z) is regular for |z| < R. Moreover its derivative f'(z) is 

ce 

2 ma,2" for iz) =< Re 

: . 


Proof. By theorem 4.62 the series Xna,,z"—1 converges for |z| < R. 
So also does the series, soon to be used, &n(n—1)a,z"~*. 
Fix z, and take € = z+h with 


lzi+|h| < R, < R. 
We wish to prove that, as h > 0, 


s) A. are. ~ y na,z"-1 > 0. (10.21) 
0 1 


The coefficient of a, is 0 for n = 0 or 1, and for n > 2 is 
(Crt Om 4+ tz") — nz 


4 
= ss pers as z”-P) 
pet 


(Ee Gate 
p=1 
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Since |z| < R, and |¢| < R,, the modulus of the last line is at most 
n—1 
|A| ~ (n—p) Ri? = 4n(n—1) RY *IAl. 
p= 


Since s n(n—1)|a,| R2-2 converges, the left-hand side of (10.21) is 
2 
O(|h|). | 


Notes. (1) What we have proved is ‘complex differentiability 
term by term’ for power series. The proof of theorem 5.46 held only 
for differentiation with respect to a real variable. 

(2) Since all the series obtained by any number of term by term 
differentiations of a power series have the same radius of con- 
vergence, theorem 10.2 shows that a function represented by a power 
series has within its circle of convergence derivatives of every order, 
and they are all regular functions. 

It will appear in §11.9 that the property of possessing derivatives 
of every order belongs to all regular functions, whether defined by 
power series or in some other way. 

Theorem 10.2 shows, for example, that exp z, sin z and cos z are 
regular for all values of z. 

Terminology. Holomorphic (and also monogenic) is a synonym for 
regular. Some writers use the word analytic in the sense in which we 
have used regular; we prefer to reserve it to carry a wider meaning 
which will be explained in §13.4. 


Exercises 10(a) 


1. Establish necessary and sufficient conditions that 
(i) the points z,, Z., Z; are collinear; 
(ii) the triangles formed by 2z,, 22, Z3 and 24, Zs, Z, are similar. 


2. In theorem 10.2, prove that 
f@O) = nap. 


3. Prove that, if 
Oo< 4, = as = Oe 


no root of the equation 
AnZ”™ + An_1Z" 1+... +a) = O 


lies in the closed unit disc |z| < 1. 


4. State the connection between phz and arc tan (y/x), the latter having its 
principal value 0 where tan 0 = y/x and —47 < 6 < 4n. 


5. For what values of z are the following functions differentiable? 


Z, Ae, a 
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6. If, for all z in a region, fis differentiable and |f| is constant, prove that fis 
constant. 


7. Suppose that we define z; < z, to mean either 
Gi) |z| < |z.| 
or (ii) |z,| = |z.| and phz, < phz,. 
Would this establish an order among the complex numbers? 


8. Find conditions that the equation 
az+bz+c=0 


shall give (i) one value of z, (ii) no value, (iii) infinitely many values. 


9. Prove Lagrange’s identity 


n 2 nN n re =a 
bad a,b, — > |a,|? 3 |b,|?—- e3 |a,b,—a,;b,|?. 
r=1 r=1 r 1l<r<s<n 


=1 


Deduce the Cauchy-Schwarz inequality (2.13) for complex numbers. 


10. (Mean value inequality.) If |f’| < M on the segment [z,, Z.], then 
| f(z.) —f(z1)| S M|z2—2;| . 


10.3. Conformal mapping 


Some parts of complex function theory gain in clarity if geometrical 
language and diagrams are used. There will be nothing unsure in the 
foundations of what we do; everything can be expressed in analytical 
terms. But a geometrical picture can be more vivid and revealing than 
collections of equations. 

The geometrical meaning of f(z). Suppose that f is regular in a 
region D. Then w = f(z) maps D onto a set of points in the w-plane. 

Let a be a point of D at which f‘(a) + 0. Let f(a) = 5. Then, as 
= w—b 


> f(a) 
If f(a) = k(cosa+isin a), where k > 0, then 
wa] 
Iz—a| 
and Ph(w—b)—Ph(z—a) > « (modulo 27). 


Suppose that z approaches a along a curve C represented by a 
differentiable function z = z(t) for ty < t < t,, where z(to) = a and 
z'(to) + 0. In the w-plane there is a corresponding curve I’, 


w = f{z(t)} and w(to) = f'{z(to)}Z'(to). 
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Hence w’(t,) + 0 and I’ has a tangent at b. If the tangents to C and 
I make angles /, y with the respective real axes, then 


y—f = «(mod 27). 


It follows that, if two curves C, and C, meeting at z = a are mapped 
into [, and I, meeting at w = b, then the angle between the tangents 
at b to l', and T, is equal to the angle between the tangents at a to 
C, and C,, the angles being measured in the same sense. 

A mapping with this conservation of angles is called conformal. 


10.4. The bilinear mapping. The extended plane 


A good initiation into the methods of complex function theory is a 
close study of some simple functions (or mappings or transforma- 
tions). 

As a first example, let us try to form a mental picture of the © 
transformation from z to w defined by 


w = (2+i)z+(3+2i). 


Here w is a function of z of the same form as y = 2x+3 in real 
numbers. This relation (Jinear in the elementary sense) is seen as a 
straight line in the (x, y) plane. A corresponding picture relating z 
and w would need four dimensions, two for each complex variable. 
The best that we can do is to envisage a z-plane and a w-plane and 
to hold together in our attention configurations in the two planes 
which are connected by the given relation. To illustrate this procedure 
we discuss relations of the simplest possible types. 

(1) w = z+b. Any figure in the z-plane is transformed into its 
w-counterpart by translation through the vector b. 

(2) w = az. Let a have modulus k and phase a. Then 

w= kz, where z, = z(cosa+isin a), 

The former of these equations represents a change of scale by the 
factor k, and the latter a rotation through the angle « about the 
origin. 

(3) w = az+b. The geometrical operations involved in this com- 
bination of (1) and (2) are translation, change of scale, rotation. 

Note that (if a + 0) the mapping is everywhere conformal. 

(4) w = 1/z. This defines w for all z except z = 0. If 


z= r(cos 6+isin 0), 
then w = p(cos 6+isin ¢) where p = I/r and ¢ = —8. 
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We recall that two points A, B are inverse in a circle with centre O 
and radius c if O, A, B are in line and OA.OB = c’. 

Then w = 1/z corresponds geometrically to (i) inversion in the 
circle having centre at the origin and radius 1, and then (ii) reflection 
in the real axis. These two ss pesca can be performed in either 
order. 

The point at infinity. In example (4) there are advantages in 
adjoining a ‘point at infinity’ to the complex plane. The symbol 
co is defined to have the properties 


a+o = 0o+a= 0 
for any (finite) complex number a, and 
(b.o = 0.b = 


for any b + 0, including b = 
We agree further that 


if «+= men 4/0 cO 
and if b+0o then b/o =0. 


It is not profitable to try to define 00+<0 because algebraic 
manipulations involving this operation would lead to inconsistencies. 


Observe that in complex function theory we are content with one point at 
infinity (or one infinite value). In different branches of mathematics other 
conventions about infinite elements may be appropriate. The reader will see 
from §4.2 that if an infinite element were adjoined to the set of real numbers R? 
it would have to carry a + or — sign. The view is there taken that rules for 
algebraic manipulation would be of more nuisance than value. Such rules would 
have included, for instance, 0o+00 = 00, but oo—oo could not be interpreted 
without violation of algebraic laws. 

In plane projective geometry every straight line has a point at infinity ae 
these points form the /ine at infinity. 

A pictorial illustration of the complex number © is contained in the notes 
at the end of the chapter. 


Denoting the complex plane by Z, we shall speak of Z with the 
point co added to it as the extended (or closed) plane and call it Z. 

Returning to w = 1/z which precipitated this discussion of ©, 
we see that in the extended z and w planes the mapping is bijective, 
the point 0 in either plane corresponding to ©o in the other. 

The bilinear mapping. Let a, b, c, d be (complex) constants such 


that ad—bc + 0. Then see 


~ ez +a 


or czw—az+dw—b = 0, 
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is the most general algebraic relation which is linear both in zand in w. 
(If ad = bc, either w or z is constant.) This mapping is called bilinear 
(or homographic) and is sometimes named after MG6bius (1790-1868) 
who first made an extensive study of it. 

Suppose that c + 0, for, if c = 0, we are back at (3). Then 


ous bc —ad 
~ @ e(cz+d)’ 
a bc—ad 1 
16. w=-—+ yy ewes = cee 
C C 25 
and so the transformation is composed of simpler ones of the forms 
(3), (4). 
Solving for z in terms of w, we have 
- —dwt+b 
 cw-a- 


If we admit a point at infinity in each of the z and w planes, the 
transformation is one-one without exception since w = © corre- 
sponds to z = —(d/c)andz = otow = c/a. 


10.5. Properties of bilinear mappings 


This section is included, although its results are peripheral to the 
main lines of theory and applications, to illustrate methods of dealing 
with complex variables. When a proof of a result presents no striking 
feature it is sometimes left to the reader. 

For shortness we write w = Tz for 


oe es 
cz+ aq’ 


‘so that T is determined by the matrix 


Ca) 
clad 
with determinant different from 0. The mapping w = Tz is the same 


if each element of the matrix is multiplied by a constant k (not 0). 
The range of both z and w is the extended plane Z. 


(1) Bilinear transformations form a group. Two transformations T, 
and 7, are compounded by multiplying their matrices. The associative 


law (7,T2)T; = T,(T2T3) 
can be verified directly. 
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The unit is ( i) 


0-1 
and the inverse 7! of T is 


Couod 


(2) A bilinear mapping is determined by: three conditions. In 
particular, there is a unique w = 7z which maps 2, Z,, Z; onto ~ 
W1, Wo, Ws respectively. For proof, we could substitute the pairs of 
values in w = Tz and solve for a: b: c: d. It is much easier to intro- 
duce an intermediate variable £ which takes the values 0, 1, co when 
zZ iS Z,, Zo, Z3 respectively. We can then write at sight 


to give the right values of ¢ for z, and zs. The condition ¢ = 1 for 
z = Z, fixes k, and we have 
ea (z — 2,) (22 — 2s) 


(Z— Zs) (Z_— 21) 
From the similar connection between ¢ and w we find 


(w—W,)(We—Ws) _ = 23) (22-2; Zs) 


(w—W,)(W2— Ws) (2a 
Corollary. If, in the language of projective geometry, we define 
the cross-ratio (z, Z, Z3 Z4) of the four points to be 


(2, — Zp) (Zs — 24) 


(Z; —Z4)(Z3— 22)’ 
then cross-ratios are invariant under bilinear transformations. 

(3) The cross-ratio (z12Z2Z3Z4) is real if and only if the four points 
lie on a circle or straight line. 

Proof. Ph(z,2.2Z3Z4) = 3 ae ye eee 


24 23 ia: 


By angle properties of circles (or lines) this is 0 or 7 according to 
the relative positions of the points. | 


In the rest of this section we shall use the word circle to mean 
either circle in the conventional sense or straight line. A straight 
line in Z is a circle through z = 00. 
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(4) A bilinear transformation maps circles into circles. 


Proof. If we assume (3), the result follows from the corollary to (2). 
Alternatively we can check that each of the transformations (3), (4) 
in §10.4 turns circles into circles. 

(5) The conformal property. The mapping w = Tz is conformal 
except at w = © or z = ©. (We shall not try to interpret con- 
formality at infinite values.) The preservation of angles of inter- 
section under the mapping w = 77z is a useful tool. 

(6) The transformation w = k(z—a)/(z—b). Here circles C in Z 
through a, b, correspond to straight lines through w = 0. Circles 
|w| = e with centre w = 0 correspond to circles D 
2—g| pp 
eS baer: 

(circles of Apollonius for a,b). Any C circle cuts any D circle 
orthogonally. 

We prove that a bilinear transformation maps a circle and two 
points inverse in it into a circle and two inverse points. This is clear 
for the elementary operations of translation, change of scale, rotation 
and reflection which are included in §10.4, (1)-(4). To see that it is 
true for the remaining operation of inversion, suppose that = is a 
circle and a, b are two points inverse in it. All circles I through a, b 
cut & orthogonally. By the conformal property of preservation of 
angles, the inverses I” of the circles I’ cut the inverse X’ orthogonally. 
The two points common to all the I’ are inverse in 2’. 

(7) Fixed points. A value of z for which z = Tz is called a fixed 
point of the mapping. The quadratic equation gives two fixed points 
(which may coincide). 

If the fixed points are distinct, say p and g, then w = Jz must be 
the same as W-Pp _ ,2-P 


where k is some constant. 
Suppose now that the fixed points coincide. In the special case 
when they are both 0 for the mapping 


= @Z+d 
~ z+’ 
so that w = z leads to z? = QO, it is clear that d = a,b = 0, c + 0, 
and hence ae uu) 
er 


where / is an arbitrary constant. 
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If the fixed points are w = z = p, this becomes 
] ] 


ee et 


w-p Zz-—p 


(8) The general bilinear mapping of the disc FB < 1 onto the disc 


|w| < lis 
z—a 


1a 


w= ef 


where |a| < 1 and a is an arbitrary real number. 


eee The frontier |z| = 1 will be mapped into the frontier 
|w| = 

"a (6), the points 0, co regarded as inverse in the circle |w| = 
will be mapped into points inverse in |z| = 1. These are of — 
form a, 1/a@, where |a| < 1. So the mapping is 


ia 
1—az 

for suitable k. Any point z = e on |z| = 1 is to correspond to a 
point on |w| = 1. Therefore 


le#—a| _ , le—al _ 
= |k | |1- \1—de*4| |k | |e-# — le-#—@| |k|. 

Hence the circles |z| = 1 and |w| = 1 correspond. By theorem 3.33 

a continuous function maps a connected set into a connected set and 


so the disc |z| < 1 is mapped into either |w| < 1 or |w| > 1. Since 
z = a goes into w = 0, we have |w| < 1. | 


Exercises 10(5) 


1. Prove that the general bilinear mapping of the upper half z-plane (y > 0) 
onto the disc |w| < 1 is 


where im a > 0 and ¢@ is an arbitrary real number. 


2. Find the transforms under the mapping 
z—2 
2z+1 


w= 


of (i) the circles |z| = 1, |z| = 2, (ii) the region 1 < |z| < 2, rez > 0. 


3. Map the unit disc |z| < 1 onto the region |w—1| < 1, taking z = 0 into 
w=4andz= 1 intow = 0. 
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4. Prove that every bilinear mapping of the upper half plane onto itself is 
expressible as ak 


ae: 
with real coefficients and ad—bc > 0. 


5. Investigate the convergence of the sequence z,, where z, = TZn_, (n > 1), 
Z, being given and T bilinear. 


6. Prove that w = (z*+3z)/(3z?+1) transforms circles through z = +1 into 
circles through w = +1. What in Z corresponds to |w| < 1? 


7. In the mapping z2—6z+10 
w= ———__, 
2z—6 
what in Z corresponds to the region |w| > 1? 
8. Prove that, if m > 1, the transformation 
1 


co 2 cos (1/n)—z 


is such that 7"(z) = z. Taking n = 3, draw diagrams to show the successive 
transforms T, T? of the disc |z| < 1. 


9. Transform the quadrantal region x > 0, y > 0, |z| < 1 into the disc |w| < 1 
(via a semicircular region, quarter plane, half plane). 


10. Frame a definition of z, > 0 as n>. 


11. Prove that, if p(z) is a polynomial, |p(z)! > as |z| > 0, 


10.6. Exponential and logarithm 


We assume a knowledge of the exponential, logarithmic and 
trigonometrical functions of a real variable such as can be gleaned 
from Cl, chapter 6. Familiarity with the power series for exp x, 
sin x, cos x (either taken as definitions of the functions or obtained 
from other definitions) is taken for granted. 

The power series 
z2 
2! 
converges for all complex z. Properties deduced from it such as 

exp (z+w) = expzexpw 


gu 
expz = 14+2z+ toto te. 


are valid for complex numbers as well as for real numbers (C1, p. 105). 
The function expz is regular for all z and (theorem 10.2) its 
derivative exp z is never 0. Hence the mapping w = exp z is con- 
formal for all z. To investigate this mapping write 
p(cos @+i.sin ¢) = w = expz = exp x exp iy 
| e*(cos y+isin y). 
Therefore p = e” and ¢ = y+2nm, where n is an arbitrary integer. 
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The known period 27 of cosy and sin y shows that Pup < z has 
period 27i (C1, 115). 

Straight lines x = constant correspond to circles |w| = constant 
and lines y = constant correspond to rays ¢ = constant. (Note the 
preservation of orthogonal intersection.) Any strip of height 27 in 
the z-plane is mapped onto the whole w-plane. 

Consider now the logarithmic function. Can we define it as the 
inverse of the exponential? 


Write w = Logz 
if and only if exp w = Z. 


Then r(cos 9+isin 0) = z = expw = e“(cosv+isinv). Since no 
value of w makes exp w zero, Log z must remain undefined forz = 0. 


so that Log z has infinitely many values differing by multiples of 27. 

As in §10.2, we can define a region D by deleting from Z an 
assigned ray from the origin, say the ray phz = «. If we then specify 
the value of Log z at one point of D, this determines a branch of 
Log z continuous at all points of D. Phz will satisfy, for a fixed 
integer k, the inequalities 


a+2kn < Phz < «+2(k+1)z7. 


The following definition is now natural. 
The principal value of Log z, written log z, is the value of Log z 
for which Phz takes its principal value phz. In fact, 


log z = log |z| +iphz 
= u+lv, Say, 


where —m™<vu<7 
If z is real and positive, the principal value log z is the real value. 


Properties. (1) If z + 0, then 


for any value of Log z. 
Proof. Suppose that w = Log z is defined in a region D with 
a+2kn < Phz < a+2(k+1)7. 
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Let a be a point of D and b = Loga. Then, at a, log |z| and Phz 
are continuous and w—b > 0 as z—a>0. 


Log z—Loga _ Ss Se | 
z—a ~ expw—expb expb a’ 
(0) lon 42) = 2B a ee: 


Proof. The principal value log(1+z) is uniquely defined in 
|z| < 1 because, for all such z, re(1+z) > 0. 
The left and right-hand sides have the respective derivatives 


1(i+z) and 1-—z+2?-... 


which are equal for |z| < 1. 

From §10.1, example 1, the difference between log (1 +z) and the 
series z—4z?+ ... is constant. This difference is 0 for z = 0 and 
hence for all z in |z| < 1. | 


(3) Log z,+Log z. = Log Z,2,. 
log z, + log z, = log 2,Z).+2kz7i, 


where k may be 0, 1 or —1. 


Complex powers. 
Definition. The power z’ (where z + 0) means exp (€ Log 2). 

In general zé has infinitely many values. If ¢ is an integer, z‘ is 
single valued. In particular, z~t = 1/z. 
Definition. The principal value of z® is exp (¢ log 2). 

By convention eé is always interpreted as the principal value, that 
is to say, exp ¢. 

Properties. (1) zz’: = z+ provided that the same value of 
Log z is taken in each of the powers. 


(2) _ zé = €z‘-1, provided that the same value of Log z is taken 


in each of the powers. In particular, if the powers have their principal 
values, the equation holds for —m < phz < 7. 


Exercises 10(c) 
1. For what interpretations of the many-valued expressions are the following 


true? 
I/zé = z-§, zbx zh = (2,29), (z5s)$2 = 28s 8s, 


2. Find all the values of i*. 
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3. Describe the mappings 
(+2) 
w=-—|{z+-], w= coshz, 
3 


4. Find a regular function u+ iv, given that 
v = e"(xcos x+y sin x). 


5. Prove that w = tan z maps a strip of the z-plane onto the w-plane with the 
points +i deleted. 


6. Examine for convergence and sum the series 


cos nO cos n0 cos” 0 


S(- yr SSE Spa SEO. 
1 n 1 n 


7. Prove that, if all zeros of the polynomial p(z) have positive real parts, so have 
all zeros of p’(z). 

Prove that, if all zeros of p(z) lie in a convex polygon P, the zeros of p’(z) 
lie in P. 
8. Discuss convergence and uniformity of convergence of 


ol 


p> 2 (principal values). 
1 


9. Prove that the function of two complex variables z, w defined (with principal 
values) by 


ie.0) 


f(, w) = 2 z*ne* 
n=] 
is continuous in each of the three sets of (z, w) 
(i) |z| <1, w arbitrary, 
Gi) jz| <1, ten = 9; 
(iii) |z] = 1,z +1, wreal and < 0. 


10. Dirichlet’s problem. The problem is to find a function harmonic (§10.1) in 
a given region taking assigned values at points of the frontier. The general 
problem is difficult and outside the scope of this book. The reader has the means 
of solving it in a disc with continuous values on the circumference. 

Use theorems 9.73, 9.74, 10.2 to find U(r, 9) = u(r cos 8, r sin 6) where u(x, y) 
is harmonic for x2+ y2 < 1 and, asr >1-—, U(r, 9) > f(@) where fis continuous 
for -7 <0 <7. 

Find U(r, 9) when f/(@) = 7 cos 480. 


NOTES ON CHAPTER 10 


For alternative contemporary accounts of complex function theory see the books 
of Ahlfors and Cartan. They start at the beginning and carry the subject further 
than we do in chapters 10-14. 


§10.1-10.2. The Cauchy-Riemann equations commemorate the two men who first 
showed the power of complex function theory. Cauchy’s exposition was purely 
analytical. Riemann, three decades later, gave full play to geometrical insight. 
Complex functions and functions from R* to R?. There is a natural one-to-one 
correspondence between the points of R? and those of Z. This induces a bijective 
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correspondence ¥ between the functions from R? to R?® and the complex 
functions (i.e. functions from Z to Z). Also, the usual metrics in R? and Z make 
these spaces indistinguishable as metric spaces. Hence, in ¥, continuous functions 
correspond to one another. 

As vector spaces, R? and Z differ; R? is two dimensional while Z is one dimen- 
sional. It is therefore not surprising that -A does not give a one-one corres- 
pondence between linear functions. In fact, if the complex function A+ im is 
linear, then the function (A, “)7 from R? to R? is linear; but, if (A, ~)7 is linear, 
A+ i is not necessarily linear. When 


Xx, y) = axt+ by, 
M(x, y) = cx+dy, 
then A+iu:Z > Z is linear if and only if 
a=d and b= —c. 


It follows that ¥ does not give a one-one correspondence between differentiable 
functions. If the complex function u+ iv is differentiable, then (u, v)? : R? > R? 
is differentiable; but if (u, v)” : R? > R? is differentiable, then u+iv is differen- 
tiable if and only if 
u,=v, and wy = —D,. 
It is this restriction that makes all the difference between the theory of different- 
iable functions from R? to R? and the theory of complex differentiable functions. 
The operators 6/0z, 6/6z. We express some arguments, e.g. §10.1, example 2, 
in terms of conjugates z, Z, but some writers use them more extensively than we 
do. It is possible to treat formally z, Z as independent variables, though in fact 
they are connected. On this basis, 


of _ 1 (af *) wee of 
éz 5 (Fe So eS ae 
The condition for regularity of f then takes the compact form 
of 
oie 0 


This usage is particularly helpful in dealing with functions of more than one 
complex variable. 


§10.4. The Riemann sphere. There is a geometric model in which all complex 
numbers, including 00, are depicted on a sphere. Let points z= x+iy be 
represented as usual in a plane Z. Through the origin O(z = 0) draw a line 
perpendicular to Z carrying a third coordinate u. Take the sphere S 


xe+ yi + (u—4)? = 4. 


Let N (the north pole) be the point (0, 0, 1) of the sphere; the tangent plane there 
is parallel to Z. 

Let Q be the point z = x+iy of the plane Z. Let QN cut the sphere in P. 
Then P shall be the point of S representing the number z. The point N (having no 
finite counterpart in Z) represents ©0, 

We verify that there is a metric defined on the sphere. We have NP.NQ =1. 
Suppose that Q,, Q. are points of Z representing z,, z, and that P,, P, are the 
points of S derived by projection from N. 
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Then the triangles NO, Q,, NP,P, are similar and therefore 


Q,0, NQ, NQ,.NQ,’ 


Hence ee 
|Z, — Z| {i+ |z,|7) * |Z2|2)}*" 
The chordal distance P,P, satisfies the conditions for a metric; let us call it 
P(Z1, Z2). 
If z, = 0, the corresponding formula is 
1 
NP, : 


NO, (1+ |al* 


The reader may verify that, if (zm, Zn) > 0 as m > ©, n > ©, there is a unique 
Zo (possibly 00) such that lim (Zs Zn) = 9. In fact, with the chordal metric, the 
extended plane Z is a complete metric space. 
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COMPLEX INTEGRALS. CAUCHY’S 
THEOREM 


11.1. Complex integrals 

The deeper-lying properties of complex functions were first 
brought to light by integrating the functions along curves in Z. 

We have already to hand in §8.1 and §8.2 definitions and theorems 
for curves and integrals in R”, and it was noted that they are applic- 
able to Z. It will now be helpful to restate some results and to prove 
new ones in notation which is most appropriate to complex functions. 

Given a path y represented by 


z=2)=x()+ip(t) (@<t <b) 


and a continuous function f = u+iv: G—Z, where G is an open 


set containing the trace of y, we define | f to be 
7 


[ few@zOde 


proving as in8.2 that the value is independent of the particular 
representation of y. The integral is the same as 


b b 
[. (« G09) aril ( ao “| dt, 
and it could be defined as such. 

The integrals with respect to ¢ are Riemann integrals of piecewise 
continuous functions. In proofs which follow, we can treat the 
integrands as continuous and omit any mention of the finite number 
of discontinuities that dx/dt and dy/dt may have; the proof would be 
completed by adding the results for the intervals of continuity. 

If z(a) = z(b), so that the initial and end points coincide, we call the 
path a circuit. 

If z(t,) + z(t.) for different ¢,, t, in a < t < b (except for a, D if 
the path is a circuit), that is to say there are no double points, the 
path is called simple. (A circuit is not assumed to be simple unless 
this is explicitly stated:) 

We shall prove in theorem 11.11 an inequality which is in constant 
use. 
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Lemma. If f is a complex-valued function defined ona < t < <b, then 


[roa < [isolar 


Proof. The simplest method is to approximate to the integrals by 
finite sums and use the known inequality 


> J(u? + 07). 
1 
The following alternative is more stylish. If « is a constant, 
b b b 
- le~* { 0) a s { re {e-i2 f(t)}dt < | Lf(O)| dt. 
b 


If « is chosen to be the phase of | f(t)dt, the first term in the line 


nm nr 
SUti yy) <S 
1 1 H 


above is 


[. f(datl. 


(if that integral is 0, then is not defined, but no proof is needed.) | 


Theorem 11.11. If |f| < M andy has length I, then 


< MI. 


Proof. - the lemma the left-hand side is at most 


ee a) *G)i*- 


From time to time we shall need theorems about complex integrals 
which extend results known for real integrals. The next theorem, on 
term by term integration, is an instance. 


from theorem 8.13. - 


Theorem 11.12. Let each function f,, be continuous in G. Let the 
sequence f,, converge uniformly to f on any compact subset of G. Then f 
is continuous in G and, for any path y in G, 


lel 


Proof. The continuity of f in G follows from theorem 5.21. If 
fn = Un tiv, and z = x(t)+iy(4) is a representation of y, then 


ee dx dy Oy > ge dy 
if =| (u, di on Gp) ati |, (v, Gi tty Gt) de 
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Theorem 5.22 is then applicable to the integrals on the right-hand 
side (the interval [a, b] being the union of pieces in which dx/dt and 
dy/dt are continuous). | 


11.2. Dependence of the integral on the path 


Suppose that fis continuous in a region D of the z-plane. If we 
take any two points Zo, z in D, it is important to know in what cir- 


cumstances | f has the same value for all paths in D joining z, to z. 


Y 
This is the same as asking whether | f round every circuit in D is 
zero, because, if y, and y, have the same initial and end points, y, —y. 


iS a circuit and = { — | ; 
yi ys Th V2 


Theorem 11.2. The value of | f is the same for all paths y in D joining 
y 


any two given points z, and z if and only if there is a function F regular 
in D such that F'(z) = f(z) for all z in D. 


Proof. Sufficiency. If y is the path z = z(t) for a < t < 5, then 
{ jS= | ’ Fey % dt 
5 - dt 
bd 
z | “5 Feha 


= F{z(b)}— Fi{z(a@)} 
= F(z)— F(Z). 


Necessity. Fixing Z), we note that | F(&)d¢ along any path in D 


defines uniquely a function F(z). 

If |h| is small enough, the straight segment (z, z+/) or A say is in 
D and 
F(z+h)- F(z) = | s 


Divide by A and let / tend to 0. Since f is continuous, we have 


F'(z) = f(z). | 


Theorem 11.2 enables us to calculate easy complex integrals by 
the technique of the indefinite integral familiar for real functions. 


A convenient notation is | f= [FI,. 
E 
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Example |. If n > 1, then, along any path joining Z, to Z,, 


21 
| z-1dz = (zf —z)/n. 
z 


0 


Proof. z"/n is regular for all z and has derivative z”-". | 


Example 2. Let y be the circle with centre a and radius r defined by 
z=atre’ (0 <@< 2n). 


Then, if 7 is a positive or negative integer, 


| (z—a)"“1dz =0 (n + 0), 
: dz 


27 
Proof. The integral is r” | en? ido. | 
0 


Example 2 is important. If n > 1, there is an indefinite integral 
(z—a)"/n regular for all z, as in example 1. If < 1, then (z—a)"/n is 
regular, not in the whole z-plane, but in the punctured plane 
0 < |z—al, and this suffices to give the result. 

If n = 0, and D is any region containing y (say the annulus 
r—06 < |z| < r+6), then there can be no function regular in D having 
derivative 1/(z—a). If there were such a function, the integral round 
y would be 0, whereas it is 277. 


Example 3. (Integration by parts.) If, in D, the functions ®, Y’ are 
regular and ©’, ”’ are continuous, then, for any y in D, 
(OV'+0'P) = [OF], 
Proof. Clear. 


11.3. Primitives and local primitives 


The noun primitive is often used in the same sense as indefinite 
integral in the calculus. It will be appropriate in this chapter. 
We always suppose that the function fis continuous in a region D. 


Definition. A function f has a primitive in D if there is a function F, 
regular in D, such that F'(z) = f(z) for all z in D. 


Definition. A function f has a local primitive in D if, given any point 
Z, in D, there is a disc with centre Z, in which f has a primitive. 
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It will be observed that the step from the first definition to the 
second is like that in §10.2 from regularity of a function in a region 
to regularity at a point. 

We shall need the following modification of the necessity part of 
theorem 11.2, in which restrictions will be placed on the region and 
on the paths of integration. For the detailed definition of the bound- 
ary of a rectangle, the reader may refer to theorem 8.31, corollary 3. 


Theorem 11.3. Let D be an open disc. Let | f = 0 whenever y is the 
TY 


boundary of a rectangle, with sides parallel to the real and imaginary 
axes, contained in D. Then f has a primitive in D. 


Proof. Let z,) be the centre and z any other point of D. The rect- 
angle, with sides parallel to the axes, whose opposite corners are at 
Zo, Z, lies in D. There are two paths from 2, to z, say 7,1, Y2, each along 


two sides of the rectangle. Then | f=| j= | fF = 0, from the 
v1 Ye 


Yi- V2 
hypothesis. Define F(z) to be the common value of { Jena} 7. 
1 Y2 


If h is real and small enough for the horizontal segment (z, z+/) 
to lie in D, then 


F(z+h)-F(2) = { : fe+tdt. 


Divide by h and let h > 0. Then, since fis continuous, 


OF 
5x J): 
Similarly 5 = if(2). 


If F(z) = U(x, y)+iV(x, y), these equations show that U,, U,, V,, Vy 
are continuous and satisfy the Cauchy-Riemann equations. By 
theorem 7.14, U and V are differentiable. By theorem 10.1, F is 
differentiable and F’(z) = f(z). | 


Exercises 11 (a) 
1. Calculate the integrals round the circle z= Re” (0 < 0 < 27) of x,y, 
Z|, xy. 


2. Calculate the integrals along the line segment from 1 to 2+ i of x, cos z, e’. 
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3. Evaluate the integrals round the unit circle (anticlockwise) of 

Zz, |z—1|, 2? (principal value), 

z~* (the branch continuous for 0 < 8 < 27 and equal to 1 for 6 = 0). 


4. Which of the following functions possess primitives in the disc B(1; 1)? 
(i) log z, (ii) sin(1—z), (iii) z+ (principal value). 


Give illustrations of functions which have local primitives in the punctured 
disc 0 < |z—1| < 1 but not primitives therein. 


5. Let Z, denote the plane Z from which the segment —1 < x < 1,y = 0 is 
deleted. Examine whether 
: lo (1 -) 
aaps ¥ 


6. Find primitives of the following functions, specifying the regions in which 
you claim the property: 


have primitives in Z,. 


1 
exp (z—- 1), 7244! tan Z. 


7. Let f be regular in a region D and y a circuit in D. Prove that tf ‘is a pure 
Y 


imaginary. (It may be assumed that /’ is continuous.) 


8. Let y be the semicircular path z = Re*®, where 0 < 0 < 7. Prove that, as 


R->o, 
aces Op) 
y z2—10 R 
Make an O estimate for = dz. 
Y 


9. (This is a useful fact e.g. for §14.1, example 2.) Let f be continuous in a disc 
punctured at z = a. Suppose that, as d>0, de’? f(a+ de**) > k uniformly for 
a<O0<Af. If yis the arcz = at+de%,a <6 < f, prove that 


lim | f= ik(P—«@). 
5-0 Jy 


10. State and prove the analogue of exercise 9, when Re*? f(Re*”) > k as R + 00, 
uniformly for « < 6 < Pandy is the arcz = Re”?,a<O0< f. 


11. Taking ie cae 
(P) f(z)dz (a real) 
a—-Ww 
to mean the principal value v 
lim fat iy)idy, 
You J-Y 


1+it0 
evaluate 3 (P) { a 


l-io @ 
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12. By comparing the integrals 
7 7 
[" aired, [" |fe%Ia0, 
<7 == 77 


N 
where f@) = 2 (—1)"a,2", 
n=1 
or otherwise, prove that 
| N an An 


N 
<7 > |a,|?, 
m,n=1M"—N n=1 


where the a’s are complex, and X’ means that terms with m = n are omitted in 
the summation. 


13. Prove that, if I is the image of the path y by the mapping w = f(z), then 


[ sondw = [ ofenseds 
I Y 


11.4. Cauchy’s theorem for a rectangle 


We now approach the grand theorem of complex function theory. 
It is normally stated as the vanishing of the integral of a function 7, 
regular in an appropriate region, round a circuit. 

We prove this first in a special case. 


Theorem 11.41, Let R be a closed rectangle with sides parallel to the 
axes and OR the circuit of its boundary. If f is regular in a region con- 
taining R, then 
f = 0. 
OR 


The circuit of the boundary of a rectangle with sides parallel to 
the axes is defined in theorem 8.31, corollary 3. 


ee 


into four rectangles of a quarter its size. The integral round @R is 
the sum of the integrals round the similarly oriented circuits of the 
four small rectangles, because the integrals along interior sides cancel 


Proof. Write I(R) = . Divide R by lines through its centre 


in pairs. 
We can therefore select one of the smaller rectangles, R, say, for 
oe I(Ry) > H(R). 


Now quadrisect R,, finding an R, for which 

AR PUR): 
Repeating the process of quadrisection we arrive at a rectangle R,, 
for which I( R,,) > 4-2 I( R). 
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There is, by exercise 3(d) 7, one point z) common to all the closed z=. 
Since f is regular at Zo, given ¢, there is 0 such that 


F(Z) = fo) +f’ (Zo) (Z — 20) + (2) (Z — 20) (11.41) 


where |e(z)| < € provided that |z—z,| < 4. 

If n is large enough, R,, lies inside the disc |z—2,| < 4. 

The first two terms on the right of (11.41) form a linear function 
of z and, from §11.2, example 1, their integral round @R,, is 0. 

On @R,, the modulus of the last term of (11.41) is less than 4es,,, 
where s, is the length of @R,, Hence /(R,) < 4es%. But s, = s/2", 
where s is the length of 0R. We deduce 


I(R) < 47I(R,) < 4° 4e(s/2")? = 4es°. 


Since ¢ is arbitrary, J(R) and hence { 720.4 
OR 


Note. If we assume the continuity of f’ instead of only its existence, 
Cauchy’s theorem is a straightforward consequence of Green’s 
theorem for a rectangle (8.31, corollary 3), 


[udx+ody) = ({(G-2) dx dy, 


where the double integral is taken over a rectangle and the line 
integral round its boundary. If f = u+iv is regular, then 


| faz ae [ udx—vdy) +i | (vdx+ud). 


The double integrals equivalent to these two line integrals are zero 
by the Cauchy—Riemann equations. 

In his original proof, published in 1825, Cauchy did in fact assume 
the continuity of f’. It was Goursat (1858- ~1936) who in 1900 first 
dispensed with this hypothesis. We shall see in §11.8 that it is vital 
to have a proof which does not rest on the continuity of f”. 


It will be useful later to know that the requirement in theorem 
11.41 of the regularity of f may be relaxed in specified sets of points. 


Theorem 11.42. Let f be regular in an open rectangle R and continuous 
on its closure R. Then 


fe 
OR 
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Proof. We may suppose without loss of generality that the origin 
is the centre of R. Let z = z(t) for O < ¢t < 1 bea representation of 
the circuit 0R. Let 0 < a < 1. Then the point z = az(f) traces out 
the boundary 2S of a rectangle S similar to R with linear dimensions 
reduced by the factor a. 

By theorem 11.41, 

f = 0, 
as 
Now 


| c fet icf! a | ; SeOj2'(Hat -[F {az(t)} oz’ (t) dt 


- | [ff()} faz} z(t + (1-2) | flax(}z'Od 


= p(a)+q(«), say. 


Since f is uniformly continuous in the compact set R, given e, we 
can find «, such that 


e(}—flac}| <€ if a > ap. 
Theorem 11.11 then gives |p(a)| < el, where / is the length of AR. 
So lg(@)| < (1a) Ml, 


where M = sup |f(z)| for z in R. 
Hence, as a > 1 (from below), 


| f-| fo. 

or Jas 
Therefore | 7 =o 1 

; OR 


We prove a corollary, which is a weak result compared with 
theorem 11.42 but nevertheless provides a useful extension of 
theorem 11.41. 


Corollary. If, in theorem 11.41, at a finite number of points of R, the 
function f is assumed continuous instead of regular, then 


f=0. 
aR 


Proof. Suppose that there is one exceptional point, say ¢ inside R. 
Through ¢ draw parallels to the sides of R dividing R into four 
rectangles. By theorem 11.42 the integral round the boundary of each 
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of the four is 0. Adding we have | f = 0. The corollary follows 
OR 


from a finite number of repetitions of the argument. | 


11.5. Cauchy’s theorem for circuits in a disc 


Theorem 11.5. Let f be regular in an open disc D, except possibly at a 
finite number of points at which it is continuous. Then, if y is any 


circuit in D, 
[reo 
Y 


Proof. Theorem 11.42 (corollary) shows that, for any closed 
rectangle R in D, 


f=), 
OR 


whether R contains exceptional points or not. 
By theorem 11.3, f has a primitive in D. 
By theorem 11.2, if y is any circuit in D, 


[r= 0.1 


The reader will realise that the circular shape of the region is 
inessential. It is possible to extend theorem 11.5, for instance, to a 
convex region (which contains the linear segment joining any two 
of its points). But such an extension has no air of finality. 

A typical example of an integral round a circuit which is not zero 
is (from §11.2, example 2) 


[2 = 20, 
yo 


where y is a circle described counterclockwise round the origin. The 
integrand is not regular at z = 0 and is, in fact, unbounded as z > 0. 

A survey of §11.1-11.5 may well lead the reader to conjecture 
correctly that, if f is regular in a region D, then J, f= 0 provided 
that the circuit y can be ‘continuously shrunk to a point’ in D. 

A necessary preamble to such a theorem is a short account of the 
topological notion of homotopy or the continuous deformation of 
curves. 
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11.6. Homotopy. The general Cauchy theorem 


Suppose that y,) and y, are two curves in a region D with the same 
initial point and the same end point. We choose a representation of 
each curve as a mapping y,(¢), y,(2) of the interval 0 < ¢ < 1 into D. 
(It will be plain that the particular representations chosen do not 
matter in what follows.) We have then 


y(0) = y,(0) and y,(1) = y,(1). 


We define y, and y, to be homotopic in D with fixed end-points if there 
is a continuous mapping z = d(t,u) of the square 0 <¢ < 1, 
0 < u < 1 into D such that d(t, u) satisfies 


6¢,0)=y7(4), (4D ="7 O<t< J), 
and 6(0, u) = y,(0) = y,(0) for all u, | 
8(1, u) = yo(1) = y,(1) for all u.] 


Thus, as uw increases from 0 to 1, 7, is continuously deformed into 
Y1- 

For integration we require y, and y, to be paths, but the mappings 
z = o(t, u) for 0 < u < 1 can be general curves. 

Suppose now that, instead of having fixed end points, each of y, 
and y, is a closed curve. If there is a continuous mapping z = d(t, u) 
satisfying for 0 < u < 1, instead of (11.61), 


6(0, u) = d(1, u) (11.62) 


(11.61) 


we call the paths y, and y,, homotopic as circuits. 
If, in particular, d(t, 0) is constant for 0 < t < 1, so that y, is a 
single point, we say that y, is homotopic to a point in D. 


Theorem 11.61. Let f be regular in a region D, except at a finite 
number of points at which it is continuous. Let yo, y be paths in D 
which are homotopic, 

either (i) with fixed end points 

or (i) as circuits. 


Then 7 tals 


Proof. Let 6 be a mapping of the squareR:0 <t<1,0<u<l, 
into D as prescribed in the definition of homotopy. 
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We can take dissect ons of the ¢, u intervals 
Geet, e242 eh ta ee E, & 4, 
Vem <u, =... 5 ty oy Ko < = 1, 


so fine that every rectangle 


Rij (ti St < tiga, Uy < UK Ujyy) 
is mapped by z = d(t, uw) into an open disc B,,; contained in D. 

If 0 <j <n, we replace the curve z = d(t, u;) by the polygonal 
path with vertices at z;; = d(t;, u;) for i = 0, 1, ..., m. Call this poly- 
gonal path y,; (0 < j < n). The paths y, and y, = y are given by 
z = O(t, uo) and z = d(t, u,). 

Write y,; for the arc of y; joining z,; to Z,,;,; and 7,,; for the linear 
segment joining z,; to Z;, 513. 

Then the segments 

Vigo Ni+1,9 Vigra 7 Vai 


form a circuit lying in the disc B,,. 
Integrating the function fround this circuit, we have from theorem 


1 es: 
[ s-[ s+] s-] seo. 
Vij Yi,j+1 Ni+1,5 Ni 


Summing this for 0 < i < m—1, we obtain 


J ig a e +s oe 


Now sum for 0 <j < 2-1. Under hypothesis () %; and , ; 
both reduce to points. Under (ii) the paths %,; and 7,,,; are the same. 


In either case we have 
| t- | ‘f=0.| 


Simply connected regions. Intuitively a simply connected region is 
one without holes in it. This idea can be formalized in terms of 
homotopy. 


Definition. A region D is simply connected if every closed curve in D 
is homotopic to a point in D. 

An alternative definition is that any two curves in D with the same 
fixed initial and end points are homotopic. We assume the equiva- 
lence of the two definitions. 
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Illustrations. The disc |z| < 1 is simply connected: the annulus 1 < |z| < 2 
is not—the circuit z = re#(0 < @ < 27), where 1 < r < 2, cannot be shrunk 
to a point. Nor is the punctured disc 0 < |z| < 1 simply connected. 

If yo in theorem 11.61 is a point, we have the most useful of all 
versions of the theorem of Cauchy. It is the one to which we shall 
attach his name. 


Cauchy’s theorem 11,62. Let f be regular in a region D, except at a 
finite number of points at which it is continuous. Then, for any circuit y 
homotopic to a point in D, 
{ 7 =20. 
y 


If D is simply connected, the integral is 0 for any circuit in D. 


11.7. The index of a circuit for a point 


Let y be a circuit and a a point not on y. We seek an analytical 
definition of the intuitive idea of the number of times that y winds 
round a. The reader is reminded that a circuit may have double 
points: with a complicated circuit care is needed in counting the 
turns. 


Definition. The index of y for a is defined by the equation 
1 dz 
MY, a) = x a 


Theorem 11.7. The index is an integer. 


Proof. The following proof has the advantage of not resting on 
the geometrical notion of angle. 
Let y be z = z(t) for a < t < f. Then 2(f) = z(a). Consider the 


function 
fe Se 
ht) = oer u 


It is continuous for « < t < f#. At every point of continuity of z’(¢), 
the function A(t) has the derivative 


ee 
h'(t) = ae 
Hence the derivative of e~*{z(t)—a} is zero except possibly for a 
finite number of values of t. Therefore e~{z(7) — a}, being continuous, 
is constant. Equating its values for « and f, we have e’ = 1 and 
so h(f) is an integral multiple of 277i. | 
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Properties of the index n(y, a). 

(1) For fixed a, n(y, a) is unaltered if the circuit y is continuously 
deformed without passing through a. 

This is a special case of theorem 11.61. 

(2) For fixed y, n(y, a) is constant - values of a in a disc containing 
no point of y. 

To prove this, let y, be the path y translated through the vector —A. 
The position of a+h relative to y is the same as that of a relative to 


Yn. Therefore ny, a+h) = nly; 4). 


If |h| is small enough, by (1) the right-hand side is equal to n(y, a). 
(3) Suppose that D is simply connected, a is not in D and y is a 
circuit in D. Then n(y, a) = 0. 
Theorem 11.62 shows that the integral defining n(y, a) 1s 0. 
(4) If y is the circle z = e° (0 < @ < 2n), then 
nty,a) =1> for jal <1, 
nya) = 0 for jal > 1. 
If |a| < 1, we can by (2) take a = 0. We showed in §11.2, example 


2 that dz 
{ — = 2ni. 

Z 

If |a| > 1, we appeal to (3). | 


From (4) the circular path y defined by z = z)+re“(0 < ¢ < 27) 
determines two regions. The first D,, containing all points a for 
which |a—z,| > 7, is unbounded. The index n(y, a) = 0 for every 
point of D,. In the other region D,, consisting of points a such that 
|a—z,| < r, we have n(y, a) = 1 and n(—y, a) = —1. We then say 
that y is positively oriented and —y is negatively oriented. 

If y is a simple circuit of a different shape, say a rectangle, a direct 
proof could be constructed that again y determines two regions in 
one of which n(y, a) = 0 and in the other n(y, a) = +1 (according 
to the orientation). In applications of complex function theory a 
circuit is always an easily defined curve, e.g. one made up of straight 
segments and circular arcs. We shall speak of its outside (points a for 
which n(y, a) = 0) and its inside (points a for which n(y, a) = 1 ify 
is positively oriented). 

There is a general theorem that any simple closed curve divides 
the plane into two regions, the outside of the curve and the inside. 
This topological result, Jordan’s curve theorem, is harder to prove. 
(There is a proof in Newman’s Topology of Plane Sets of Points.) 
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Exercises 11(d) 


1. Adapt the proof of theorem 11.41 from a rectangle to a triangle, and hence to 
a simple polygon. 


2. Prove that theorem 11.41 holds if R contains a finite number of points ¢€ at 
which /f, though not regular, satisfies the condition lim (z— ¢) f(z) = 0. (Suppose 
2—>€ 


that there is one point ¢. Divide R into nine rectangles, the central one Ry con- 
taining ¢; make Ry small). 


Observe that, in the illustration = — = 27i in §11.5, the integrand 1/z is just 
too large as z > 0 to satisfy the requirement in the preceding paragraph. 
3. Comment on the following suggested proof of theorem 11.41. 

Taking OR as the circuit of integration, and 0 < A < 1, write 


I(A) = { f(Az) dz. 


Then I’(A) = ; gf(Andz = | | ~ | = dz 


Therefore EG) andso J(A) = A 


_/ 
A 
Letting A > 0, we have A = 0. Now put A = 1. 


4. Compute directly the index of the circuit formed by x = +a,y = +a for 
the origin. 


11.8. Cauchy’s integral formula 

The following is a far-reaching consequence of Cauchy’s theorem 
11.62. 
Theorem 11.81. Let f be regular in a region D. Let y be a circuit in D 
which is homotopic to a point. Let a be any point of D not lying on y. 


Then 
nty, a) fla) = 57, | LEP de 


Proof. In D define 
ge) - OO +2, 
g(a) = f(a). 


Then g is regular except at z = a, and g is continuous at a. Therefore 
(theorem 11.62) 


0% [ soa =| Le) - [ 22 ae 


In the last term use the definition (§11.7) of the index n(y, a). | 
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Corollary 1. In most applications y is a simple circuit winding once 
round a (e.g. the circle z—a = re” for 0 < @ < 2m). If the circuit is 
positively oriented, the formula is 


f(z) 
Corollary 2. Suppose that f is regular in the annulus R < |z| < S. 
Let B, C be circles with centre z = 0 and radii r, s, described counter- 
clockwise, where 


== 


Rr = a <5 <5. 


<a fa) = 2, f $a Jf £2 a 


271i JoZ— o Dai 


First proof. Define g(z) as in theorem 11.81. The circle B can be 
continuously deformed into C via the paths z = pe“ (0 < t < 2zn), 
where r < p < s. By theorem 11.62 


| gdz - | gdz 
C B 


and the proof is completed as in theorem 11.81. 

Second proof. Draw a common diameter of the circles B, C, not 
passing through a. 

Apply theorem 11.81 in turn to the 
two simple circuits, one indicated by 
continuous arrows and the other by 
dotted arrows. Add. The two integrals 
along each straight segment cancel. | 

Cauchy’s integral formula shows oe 
that, if the value of a regular function 
is known at every point of a simple 
circuit y homotopic to a point, then its 
value is determined at every point 
inside y. 

In succeeding theorems, in which a will be variable, our usual 
notation will be to replace a by z, and to denote the variable of 
integration by ¢. 

We observe that the only property of f required for the integral in 


theorem 11.81 to have a meaning is continuity on y. This remark 
suggests the next theorem. 
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Theorem 11.82. Let $(€) be continuous in an open set containing a 
path y in the €-plane. If z is not on y, define 


pte) = f 2a 
Then f'(2) = "na 


Moreover f'(z) is continuous. 
Observe that, in contrast with theorem 11.81, the path y is not in 
general closed. 


Proof. We prove from the definition that f’(z) is obtained by 
differentiation under the integral sign. 

Suppose that |¢(¢)| < M on y, that y has length / and that d is 
small enough for the disc with centre z and radius d to contain no 
point of y. From the definition of /, 

fiz+h)-f@) _ | H(Odé 
h y(S—z)(G—z—A) 
Therefore : 
fEth-f@_( Ode _ a x PEE 
h y(€-z)? (Se 
Take |h| < 4d and use theorem 11.11. The modulus of the right- 
hand side is less than \A| M1 


Let h > 0 and we have the integral formula for f’(z). 


To prove now the continuity of f’ for any z not on y we again 
estimate a bound of the relevant integral. If |h| < 4d, 


re+h-1'@ =| |e —p- awl HOM 


1 1 h 
= | leceaa* Fa} Cee Oe 
and the modulus of this is less than 


eM 


which tends to 0 as h tends to 0. | 
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11.9. Successive derivatives of a regular function 


Combining theorems 11.81 and 11.82 we see that, if fis regular in 
D, then f’ is continuous. Much more than this is true: 

A regular function has derivatives aS every order and they are all 
regular functions. 

We must prove this fact (which will be formally stated as theorem 
11.91). The reader may divine correctly that the argument of theorem 
11.82 by which f’ was proved continuous could be refined to prove 


that 
nr 9'@) = 3 | oO at 


A direct extension, needing induction, to f(z) on these lines is 
laborious. The following is an alternative proof, depending on 
integration by parts. 


Theorem 11.91. Let f be regular in a region D. Then it has a derivative 
of every order and every derivative is regular in D. Also, if y is any 
circuit in D homotopic to a point, and z is in D but not ony, then 


ny, af) = 3 | 


271 


Proof. There are four stages (i)-(iv) in the proof. 
(i) We shall prove f’ continuous in D. Given Z, in D, let 7, be the 


path § - Zotre’ (0 <t < 2n), 
where B(z,; 2r) < D. Then, if oa < r, by theorem 11.82, 


SE 
Oe Fai »(S— zp 4s 


Also by theorem 11.82, f’ is continuous inside yy a SO at Zp. 
(ii) Next we prove the stronger result that f’ is regular in D. 
Since f’ is continuous we can integrate the last equation by parts 
(§11.2, example 3) and obtain 


r= 2,f Oa 


ni 
By theorem 11.82, 


2ni J y,(¢—Z)? 


fO=s| Xa 


if z is inside Y, and so for z = Zp. 
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(iii) From (ii), the derivative of any function regular in D is 
regular in D. Therefore the derivative of f of every order exists and 
is regular in D. | 

(iv) Let now y be any circuit in D homotopic to a point, and z 
any point of D not lying on y. Then, by theorem 11.81, for any 


integer r, ©) 
nt, af) = 52 [ FO ae 


Integrating the right-hand side by parts r times, we have completed 
the proof. | 


We emphasize again that in most applications y is a positively 
oriented simple circuit enclosing z, so that n(y, z) = 1. 

Morera (1856-1909) proved in 1886 a theorem converse to 
Cauchy’s, namely that if the integral of a continuous function round 
certain circuits vanishes, then the function is regular. 


Theorem 11.92. (Morera.) Suppose that f is continuous in a region D. 
Let z, be an arbitrary point of D. Suppose that there exists p such that 


| f = 0 round the boundary y of any rectangle, with sides parallel to 
y 
the axes, contained in B(z,; p). Then f is regular in D. 


Proof. By theorem 11.3, fhas a primitive, say F, in B(Z); ) which 
is regular in that disc. By theorem 11.91, the derivative f of F is 
regular in B(z); p), in particular at z 9, which is any point of D. | 


In conclusion, the reader should satisfy himself that he can relate 
each of the implications in the following summary to the theorem in 
which it has been proved. 

Write: R for the statement f is regular in D; L for the statement f 
has a local primitive in D; P for the statement f has a primitive in D; 


I for the statement | f = 0 for every circuit y in D. 
¥. 


Then, with no restriction on the region D, 
[l=P=>Le_rR. 


If D is simply connected, then P = L. 

Proofs without Cauchy’s theorem? Suppose that f is regular in D, 
that is to say it has a derivative f’. Using Cauchy’s integral formula 
we proved as a first step that f’ is continuous and then established 
the existence of derivatives of all orders. Naturally the reader will 
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ask whether so apparently innocent a result as the continuity of /’ 
can be proved more directly. It seems to have nothing to do with 
integration; can it not be proved without recourse to Cauchy’s 
theorem? It can, and indeed the existence of all the derivatives can 
be proved in a few pages by a more searching analysis of the mapping 
by w = f(z) of a region in the z-plane onto a region in the w-plane. 
But this was achieved only in 1960; references are given in the notes 
on Chapter 11. In 1958 a leading topological analyst, G. T. Whyburn, 
could still write that a direct proof of the continuity of f’ ‘remains 
inaccessible at present by these methods’. 


Exercises 11 (c) 
1. Give the values of the integrals 


z* e? 
[ pe dz, [ ae ae dz, 


where y is the circuit 
Y z=atre® © <6 <2n). 


2. What are the possible values of 
Se 
irc 


for paths joining 0 to z (not passing through +i)? 


3. What are the possible values of 
[ dz 
y 2(1 — 27)’ 


4. Prove that, if y is a circuit having index 1 for the origin, 


: 3 eae x” exp zx 
n}; may, At" 


where y is a simple circuit? 


dz. 


Deduce that : 
co x” 1 2n 
pS (=) —_— i exp (2x cos 4) dé. 
0 


nie ae 


5. Suppose that fis regular in the disc B(O; 1+ 6). If |z| < 1, show that 
1 1-—¢Z 
(-|2)9/@) = 5 : SOe— 


where C is the positively oriented unit circle. Deduce the inequality 


2n 
(= l29 FI <5- |" eI 
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6. (Poisson’s formula.) Let fbe regular when |z| < R,. Prove that, if |a] < R < R,, 


then R°—aa 
I, @—a)\(R— za) OO” 


where C is the circle z = Re‘? (0 < ¢ < 27). Deduce Poisson’s formula that, 
fr <R, 


yews 


271i 


fire) = 


2n R?—r2 
fee ees eos es! be te Sees & ip 
= A R S(Re*”) dd. 


—2Rr cos (6—4) +r? 


7. Let f be regular for |z| < R. Prove that, if0 <r < R, 


2n 
FO) ==, |" uO) exp (ia, 
where u(A) = re f(r exp i). 


8. Taking, in theorem 11.82, y to be the unit circle described positively, find £ 
when ¢ is (i) 1/€, (ii) re €. 


9. The function fis regular in the upper half plane y > 0, and continuous on the 
real axis y = 0. Also f(z) = o(|z|) as zo. Prove that, if y > 0, 


fe)=7 tim | fOg 

WXSeo J-xX (§—x)*?+y? 
10. Let / be the segment a < x < hb, y = 0. Let D,, D, be regions each having /as 
part of its frontier, D, being in the half-plane y > 0 and D, in the half-plane 
y < 0. Let f, be regular in D, and continuous in D, U /; f, regular in D, and con- 


tinuous in D,U/. Let f, = fy at all points of J. Prove that f, and fg define a 
function regular in D, U/U D,. (Use Morera’s theorem 11.92.) 


11. (Schwarz’s Reflection Principle.) Suppose in exercise 10 that D, is the mirror 
image of D, in the line y = 0. Let f, be regular in D, and continuous in D, Ul. 
Moreover let f, be real valued at all points of /. If then f, is defined in D, by 


f.(z) = conjugate of f), 
prove that f, and f, define a function regular in D, U/U D,. 


NOTES ON CHAPTER 11 


§§ 11.1-11.3 could be written in the notation of differential forms. There is now 
an extensive theory of these forms which is of service both in analysis and 
mathematical physics. A good introduction is given by Harley Flanders, 
Differential Forms with Applications to the Physical Sciences. 

With two real variables x, y, a differential form of the first order is, 


w = Pdx+Qdy, 
where P, Q are continuous functions of (x, y) in a region D. If y is a path 


x = x(t), y = y(t), for a’< t < b then | w means 
y 


b 
2 [Pix@), ¥O}x’O) + Ox), VO} y’'Ol dt. 
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A differential form which has a primitive in D is usually called exact. A form 
which has a local primitive (i.e., is locally exact) is called closed. We have 
avoided this use of the overworked word closed. 


§§11.4-11.6. Cauchy’s theorem. Cauchy published his theorem in 1825 in his 
‘Mémoire sur les intégrales définies prises entre les limites imaginaires’. There is 
evidence that he knew the result at least ten years earlier. Ensuing discussions by 
writers previous to Goursat (1883) rested on the equality of a line integral 
f(udx + vdy) along two paths from one point to another in virtue of the Cauchy— 
Riemann equations. Such an approach could not be carried through without 
restricting f’ to be continuous. 

Goursat relied on the method of quadrisection, as used in theorem 11.41. A 
region with a curved boundary was thereby decomposed into whole squares 
and part squares with curvilinear boundaries. In 1900 Goursat achieved a 
proof assuming only the existence of f’ and not its continuity. In his approach 
from the interior of the region to the boundary he used the device, employed in 
theorem 11.42, of a family of curves z = «z(t) where O< a< 1 anda-—l. 
The success of this method depends on the region being starlike in the following 
sense. 

Definition. The region R is starlike if it contains a point z, such that, if z 
is any other point of R, the segment [ZZ] is wholly in R. 

The ultimate goal of generality in Cauchy’ s theorem at which analysts in 
the early years of the 20th century could aim was that f should be regular in a 
region bounded by a rectifiable curve. Such a region need not be starlike and it 
would be difficult to frame rules for dissecting it into starlike sub-regions. So 
Goursat’s procedure was not final. 

Pringsheim (1901) initiated a method of proof which was widely adopted. The 
quadrisection argument establishes the theorem for a triangle and hence for a 
simple polygon (exercise 11(b), 1). The step from a simple polygon to one which 
may intersect itself is not difficult intuitively. Finally a polygon can be con- 
structed to approximate arbitrarily closely to a given rectifiable curve. The 
difficulties in such a programme lie in the framing of instructions for reducing a 
polygon with self-intersections to simple polygons and defining polygons 
approximating to the curved circuit of integration. 

A proof, on principles derived from both Goursat and Pringsheim, was 
published by S. Pollard (Proc. London Math. Soc. 21 (1923) 456-82). Later, 
T. Estermann distilled the proof of the theorem for a general rectifiable boundary 
into a few pages of elegant analysis (Math. Zeitschrift 37 (1933) 556-60 and 
~ 38 (1934) 641.) 

Following rapid advances in the subject of topology from (say) 1920, topo- 
logical rigour demanded recognition besides analytical rigour. To illustrate that 
new requirement, we recall that the index of a circuit y for a point ¢ a (§11.7) 
came to be defined as ; ? 

Z 


° 3 
2m Jy Z—a 


whereas previously it had been apprehended from a diagram as ‘the number of 
turns that y takes around a’. 
The essence of Cauchy’s theorem came to be seen as the invariance of the 


integral [ f for displacements of the curve y. The topologists had, ready to hand, 


Y 
the theories of homology and homotopy which dealt with such displacements. 
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§§ 11.6-11.7. Topology. M. H. A. Newman, Elements of the Topology of Plane 
Sets of Points gives a good account of ideas and methods, with special attention 
to the parts needed in the theory of functions. 


§11.9. For proofs of the existence of higher derivatives of a regular function | 
which make no appeal to Cauchy’s theorem, see E. H. Connell, Duke Math. 
Journal 28 (1961), 73-81; A. H. Read, Journal London Math. Soc. 36 (1961), 
345-352; G. T. Whyburn. Fundamenta Math. 50 (1962), 305-318. 


12 
EXPANSIONS. SINGULARITIES. RESIDUES 


12.1. Taylor’s series. Uniqueness of regular functions 


A multitude of properties of regular functions flows from Cauchy’s 
theorem. We prove first that a function regular in a disc can be 
written as a Taylor’s series. The expression of a regular function in a 
simple standard form, such as a power series, points the way to 
further results. 


Theorem 12.11. Let f be regular in the disc B(a; R), that is |z—a| < R. 
Then, for any z in the disc, 


f(z) = fla) +(e-a)f'(a)+...+ =O" foray +. 


Proof. Without loss of generality we can take a = 0. Given z, 
choose r such that |z| < r < R. Let C be the positively oriented 
ercie € = re“ (0 < & < 2m): 

Then, by theorem 11.81, 


ZT = 
fe) = 52, [pee 
In the integral substitute 
l J a eats 


ee ot , 
2 Se ee 
This gives, by use of theorem 11.91, 
F(Z) = fO+2f' OF... + FO) + Res 


n+1 
where | ee = (Z) fe dé. 
When ¢ is on C, |z/¢| = |z|/r < 1, and sup | f(¢)| = M, say. Then 
by theorem 11.11, Ral < K(\z|/y"4, 
where K = Mr/(r—|z|). Hence R, ~O0asn—> oo. | 


Notes. (1) The proof amounts to integrating term by term the 
series obtained by inserting the binomial expansion of 1/(¢—z) in the 
integral. This series converges uniformly for ¢ on C, and we could have 
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appealed to theorem 11.12, the complex extension of theorem 5.22. 
Instead we have estimated the remainder R,, from first principles. 

(2) Contrast theorem 12.11 with the real-variable theorem leading 
to Taylor’s series (C1, 101) in which every derivative has to be 
postulated explicitly and its magnitude controlled to make sure that 
the remainder R, tends to 0. With a complex function all this is 
implicit in the sole assumption of regularity. 

(3) If we are given a power series 


Ajta,(z—a)+...+a,(z—a)"+..., 


convergent for |z—a| < R, we proved in theorem 10.2 that the sum 
is regular for |z—a| < R. Theorem 10.2 (in particular note 2 and 
exercise 10(a), 2) shows that, if f(z) is the sum, then 


a, = f{™@|n! 
Definition of a zero. Suppose f regular in B(a; R). If f(a) = 0, then 
a is called a zero of f. If f(a) is the derivative of lowest order which 
is different from 0, then a is called a zero of order n. (If every derivative 


of fis 0 at a, then, by theorem 12.11, f(z) = 0.) 
Observe that, if fhas a zero of order n at a, then 


F(Z) = @—-4)"g(Z) 
where g is regular at a and g(a) + 0. 

Moreover there is a disc with centre a, say B(a; 6), throughout 
which the continuous function g is different from 0, so that a is the 
only zero of fin B(a; 6). Thus the zeros of finite order of a regular 
function f are isolated. 

We now investigate regular functions defined not necessarily in a 
disc but in an arbitrarily given region. 


Theorem 12.12. If f is regular in a region D and the set of zeros of f 
has a limit point in D, then f(z) = 0 in D. 7 


Proof. Let a be a limit point in D of the set S of zeros of f. By 
theorem 12.11, fhas a Taylor expansion in any disc T with centre a 
which lies in D. 

If any derivative f(a) were different from 0, there would be a 
circle with centre a containing no zeros of f other than a. But this 
contradicts the vanishing of fin S. So f(z) = 0 for all z in the disc T. 

We extend this result throughout D by the following construction 
of overlapping discs. Let b be any point of D; we wish to prove 
f(b) = 0. Join a to b by a polygon P in D (theorem 3.32). Let d be 
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the distance between the sets P and D’. Then the disc of radius d 
with centre at any point of P lies in D. Take points a) (= @), @, dy, ... 
a, (= b) in order on P with each distance |a,—a,_,| less than d. Take 
a disc T. of radius d with centre at each point a,. Then a, is in T,_;. 
If f = 0 in T,_,, then, by the first part of the proof, since a, is the 
limit of zeros of f, we have f = 0 in T,. Hence f(b) = 0. | 


The following restatement of theorem 12.12 is to be noted. 


Theorem 12.13. (Uniqueness theorem for regular functions.) A func- 
tion f, regular in D, is uniquely determined if we know its values at 
points of a set S having a limit point in D (a fortiori, if S is a linear 
segment or arc, however short, in D). 

This follows from theorem 12.12 because, if fand g are two regular 
functions having the assigned values in S, then /—g = 0in D. 


12.2. Inequalities for coefficients. Liouville’s theorem 
The following simple inequalities for the coefficients in a Taylor’s 
series are due to Cauchy. 


ese 12.21. ffi 5 az" for \z| < R, and M(r) = sup | f(z) | 


for |z| =r < R, then - Mo) 
| a,| < = rn : 
Proof. From theorems 12.11 an 3) and 11.91, 
Be he, 0) IS) ge 
n = 5 OC gui 1% 
where C is the circle z = re*®(0 < @ < 2z). 
By theorem 11.11, 1 es 


2nr. | 


|2,,| < on pnt 


Definition. A function regular for all z in Z is called entire. 
Some writers call an entire function an integral function. 


Illustrations. Any polynomial, exp z, sin Z, Cos z. 


Theorem 12.22. (Liouville (1809-1882).) An entire function, which 
is bounded, is constant. 


Proof. The function has a Taylor expansion, say 


J = ay” 
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valid for all z. Suppose that | f(z)| < M. From Cauchy’s inequalities 
(theorem 12.21) M 
|a,,| < pat 


Since r is arbitrarily large, a, = Oforn > 1. | 


Liouville’s theorem has many applications. As an illustration we 
deduce from it ‘the fundamental theorem of algebra’. 


Theorem 12.23. Let p be a polynomial (not constant). Then there are 
values of z for which p(z) = 


Proof. If the conclusion is false, 1/p is regular for all z. Now 
| P(z)| - -> 00 as |z| — 00 (exercise 10(5), 11). So we can find a (closed) 
disc B outside which |1/p(z)| is bounded. But |1/p(z)| being con- 
tinuous is bounded on B. So 1/p is a bounded entire function. By 
Liouville’s theorem it is constant. This is a contradiction. | 


Exercises 12 (a) 


In each of 1-8, expand the given function in a Taylor’s series =a,,z". Find, if you 
can, the general coefficient a,, or, failing that, the first three non-zero coefficients. 
In each example state the radius of convergence of the series. 


1. The principal value of g +z)*, where k can be complex. 
: log cosh z. 
. (arc sin z)?, defined to be 0 when z = 0. 


. tan z. 


aA kb Ww N 


. (cos z)*, defined to be +1 when z = 0. 


oe 
e*+]- 


z 
7. [ e-* dC. 
0 


[e.@) gn 


1 1-—z" 
9. Expand in Taylor’s series = a,(z— 47)", 
cot z,- cosec* 2. 
10. If Xa,z" has radius of convergence 2, find the radii of convergence of 
eae alge age 
X(ak + ak +e .. ae )Z", 


where k is a fixed positive integer and in the fourth series a, > 0. 
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11. For each set of values (i)-(v) corresponding to z = 1, 4, 4, 4, 4, Z, ..., find 
whether a function exists, regular at z = 0, taking those values: 


(i) i. 0, ‘ 0, $5 0, bie 4 
d— 4, vi, v5 Nas N35 V6 oes 


(iii) 4,4 = Z, 4, z; $, cee9 


12. Prove that the distance from the origin to the nearest zero of f(z) = Janz” 
0 
is at least r |ao| 
M+ |ap|’ 
where r is any number not exceeding the radius of convergence of the series and 
M = M(r) = sup |f(z)| for |z| = r. 


13. In a region D, f and g are regular and fg = 0 at all points. Prove that 
f=0org=0. 


14. Prove that, if f(z) = Xa,z" is regular for |z| < R, then the average value 
of | f(z)|? on |z| = r, where r < R, is X|a,|?r?". 
Deduce theorem 12.21. 


15. Prove that, if for n = k Cauchy’s inequality (theorem 12.21) is an equality, 
that is 


then f(z) is a constant multiple of z*. 


16. Let f(z) = {a,2" be regular for |z| < 1+06. Prove that the polynomial p,(z) 
0 
of degree k which minimizes the ease 


] a 2 
ae ke 9) — p,(e*”)|2 dO 


k 
is p:(z) = >)a,z". Prove that the minimum value is > |a,,|?. 
0 k+1 


17. Give an alternative proof of Liouville’s theorem 12.22 by integrating 
f(z)/{(z—a)(z— b)} round a circle whose radius tends to infinity. 


18. The function / is entire. 
(i) Prove or refute the suggestion that, to prove f constant, boundedness may 
be replaced by the hypothesis 


|[f(2)| = o(|z|) as z>o, 
(ii) What conclusion follows if we assume 
|f(z)| = O(\z|*) as z>00? 


19. Prove that, if f = u+iv is regular in Z and either (i) u 2 O or (i) v < 7 for 
all z, then fis constant. Include (i) and (ii) in a general statement. 


~ 
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20. Replace the hypothesis (i) or (ii) of 19 by wv > 0. 


21. The function fis regular in Z. There is a disc B(wy; 6) such that no value f(z) 
is in B(wo; 0). Prove that fis constant. 


22. The functions f, g are regular and have no zero in a disc B(zy; rr). There is a 
sequence Z, > Zp at which f’/f = g’/g. What follows? 


12.3 Laurent’s series 


Suppose that fis regular in an annulus with centre a (instead of 
throughout a disc). Following Laurent (1813-1854) we expand f(z) 
in a Series es both positive and negative a of z—a. 


A series > Cn, 18 Said to converge if, and only if, : c, and > Gz, 


=o 


both converge. 
Theorem 12.31. Let f be regular in the annulus R < |z—al| < S. 
Then, for any z in the annulus, 
fe) = 3 a,(e—a)", 
Pee oe oe 
eee ity a -q)nt aac 


and Cis the circle € = a+pe”(0 < @ < 2n), p being any number such 
thatR<p</S. 


where 


Proof. As in theorem 12.11, we can without loss of generality take 
a = 0. 
Given z such that R < |z| < S, let C,, C, be circles, with centre 
€ = 0 and radii r, s, where 
Rexel Ss ee, 
By theorem 11.81, ween om 


fa) = 3, {| LOa AO 


Qni - 55 eee 
where C, and C, are positively oriented. 


As in theorem 12.11, the —— round C, yields the series S auz* 
where = F($) S) ae 


On mi o, Ener aS 


- To deal with the integral round C,, we expand 1/(€—z) in powers of 
¢/z (since |f| < |z|), and obtain 
1 1 4 hye eg 


=z Zz zt gett ee 7)" 
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An argument like that of theorem 12.11 shows that the remainder 
term in the integral round C, tends to 0 as m — oo, and the consequent 
infinite series is }} a_,z~”, with the value of a,, already defined. 

1 


Since the integrand in the formula for a,, is regular for R < |¢| < S, 
the restriction r < |z| < s on the radii of the circles C,, C, can be 
removed (by theorem 11.61(ii)) and we can integrate round any 
circle C in the annulus. | 


Corollary. (Cauchy’s inequalities.) If f(z) = & a,(z—a)” for 
R < |z-a| < S 
and | f(z)| < M(r) for |z-—a| =r, where R < r < S, then, for every 
integer n, positive or negative, 
M(r) 
— 


The proof follows that of theorem 12.21. 


lan] < 


Theorem 12.32. (Uniqueness of Laurent expansion.) 
(i) Iff@) = X a,(@—a)" and f(Z) = & 5,(2—a)” for 


R < |z-a| < S, 
then a,, = b,, for all n. 
(ii) If f is regular for R < |z—a| < S, then f = f,+fy, where f, is 
regular for |z—a\| < S and f, is regular for R < |z—a|. Moreover if 
Fz > 0 as |z| > ©, the expression of f as f, +f, is unique. 


Proof. (i) If we fix p with R < p < S, each of the two series for 
F(€) converges uniformly on the circle €—a = pe“’(0 < @ < 27), by 
theorem 5.41 extended to cover negative powers of ¢—a. 

It is then legitimate to multiply by (€—a)-*! and integrate term 
by term round the circuit (—a = pe’’. Hence 


mig, = 2nib, om O21, + 2, ...). 

(ii) f(2) = La,(z—a)" and f(z) = ¥ a,(z—a)" satisfy the con- 
ditions suet To prove uniqueness, ae that also f = g,4+2>. 
aa h=fi-g (\e-al < 9), 

= 82-fz (|z—a| > R) 


is regular for all z and (since h +0 as |z| > ©) is bounded. By 
Liouville’s theorem 12.22, h is constant and (from |z| > 00) is 0. | 
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Exercises 12(b) 
1. Write down the Laurent expansions of 


appropriate to the respective regions |z| < 1, 1 < |z| <2, 2 < |z|. 
Expand similarly 
Loe a Le 
(z—1)(z—2)° z(z—1)(z—2)’  (22+1) (22+ 4) 
2. Expand in positive or negative powers of z 
— 
1+z+22° 
3. Prove that, for all finite z except 0, 


[e @) 
cos {u(z+z—1)} = } c,2", 


— © 


47 
where Cy = : { cos (2u cos 9) cos 2nd dé. 
0 


4. Establish the expansion 
exp {4u(z—z7)} = DJ, (u)2", 


7 
where J <u) = * ) cos (nO —u sin 0) dé. 
0 
Prove also that, if n > 0, 
[o.e) ( mee 1)*u"t 2k 
Inu) = Zi snitke nth! 
while, if 7 is a negative integer — m, 


J(u) = In{—U). 


5. Discuss the possibility of expanding in power series 


{e-DG-2F 


in a suitable circle or annulus. 


6. The same question for log = 
loa) 

7. The regular function w= > a,2" 
— 


is a bijective mapping of the closed ring r; < |z| < r, onto a compact set S of 
the w-plane. Prove that the area (or content) of S is 


lo @) 
7 > n\a,|2(r2” — r2”). 
ce 
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12.4. Singularities 


Suppose that f is regular in the punctured disc 0 < |z—al < R. 
We ask whether, with the right choice of the value of f at a, the 
function thus extended is regular in the whole disc |z—a| < R. 


Theorem 12.4. A sufficient (and necessary) condition that f, regular 
for 0 < |z—a| < R, can be extended to a function regular in 
|z—a| < R is that 

(z—a)f(z) 70 as z—>a. 


Proof. There is a Laurent expansion 
f(z) = YD a,(z—a)” for 0 < |z-al< R. 


Let M(r) = sup |f(z)| for |z—a| =r. 
Then, from the hypothesis, 
rM(r)>0 as r->0. 
By theorem 12.21 (corollary), 
a= on Tt. 


Therefore f= ST. 


If f(a) is defined to be dp, then f(z) = > a,(z—a)" for |z—a| < R 
0 


and this is regular. | 


Definition. f is singular (or, has a singularity) at a if it is not regular 
at a. 


Definition. If f is singular at a and regular in a punctured disc centre 
a, the singularity at a is isolated. 

If the condition of theorem 12.4 is fulfilled, we can assign a value 
to f(a) so that f is regular at a. Such a point a may be called a 
removable singularity of f. We generally suppose that removable 
singularities have been removed. 


Illustration. We shall say, without more ado, that the function 
1—cos 2z 
z2 


is regular for all z, on the understanding that the value 2 is assigned to it for 
gs 8. 
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The condition (z—a)f(z) > 0 certainly holds if f is known to be 
continuous at a. In that case, there is, in fact, no singularity at a 
and it was only lack of knowledge that previously stopped us from 
asserting that fis regular at a. The clause in theorem 11.61 permitting 
a finite number of points of continuity rather than regularity was 
therefore only temporarily useful and now loses its significance. 

If a is an isolated singularity of f, there is a Laurent expansion in a 
punctured disc, say 


fle) = S a(z—a)". 


Definition. If the Laurent expansion of f has only a finite number of 


negative powers, say > a_,(z—a)~”, where a_,, + 0, then a is called 
ss | 


a pole of order m of f and > a_,(z—a)~ is its singular (or principal) 
H 


part. 
A pole of order one is called a simple pole; one of order two a 
double pole. 


Definition. If there are infinitely many negative powers of (z—a) in 
the Laurent expansion, then a is called an essential singularity of f. 


> a,(z—a)~™ is the singular part of fat a. 
1 


Illustration. exp (1/z) has an essential singularity at z = 0. 


Before §13.3 we shall be concerned only with isolated singularities. 
We shall there encounter functions having singularities which are 
limit points of singularities. 


Definition. A function which is regular in a region except at poles is 
called meromorphic. 


12.5. Residues 


We suppose that f has an isolated singularity at a and the usual 
Laurent expansion. 


Definition. The coefficient a_, of (z—a)™ is called the residue of f 
at the singularity a and is written res(f, a). 
From the definition of a_, (theorem 12.31) 


27ia_, _ [fs 
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where C is a positively oriented circle containing the singularity a 
and no other. 7 

This formula for a complex integral round a suitable circuit in 
terms of the residue of the integrand at a singularity constitutes a 
powerful method of calculating definite integrals. The next theorem 
caters for any number of singularities. 


Theorem 12.5. (Residues.) Let f be regular in a region D except for 
isolated singularities c (which form a countable set C). Let y be a 
circuit in D which is homotopic to a point in D and is free of points 
of C. Then the index of y is zero for all c except a finite number and 


[ f= 2m > ny, c) res (f, c). 


Proof. Let K be a compact set in D. Then the set Kn C is finite, 
for if not K would contain a limit point of C which would be a non- 
isolated singularity. 

We proceed to define the particular K which is used in the proof. 
The definition of homotopy requires a continuous function 


6:[0, 1]x[0,1] > D 


such that 6(0,u) = 6(1,u) for O<u <1, O¢,0) for O<t<l 
represents y and d(t, 1) (0 < t < 1) is a point curve Yo (ie. is con- 
stant). Since [0, 1] x [0, 1] is compact, so is 


K = 6({0, 1] x [0, 1). 
From §11.7 (1), if b is any point of Z—K, 
n(y, 5) = no, 5) = 9. 


Let C, be the set of singularities of fin K and C, those in D—K. 
We have proved that C, is finite and that n(y, c) = 0 at every point 
of the countable set C,. 

We shall prove that D, = D-—C, is a region. 

Let z € D,. Then z cannot be a limit point of C, for, if it were, z 
would be a non-isolated singularity of f. Therefore there is a 
B(z;¢) < D—C, and so D, is open. To prove now that D, is 
connected, let z,, z, be two points of it. There is a polygon in D 
joining z,, Z,. Its trace is compact and so it can go through only a 
finite number of points of C,. Each such point, being isolated, can be 
avoided by a small detour. 
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In D,, fis regular except at the points of C,, say c1, Co, ..., €,<‘Luet 
S(/, c;,) be the singular part of fat c,(k = 1,...,p) and, for z in D,, put 


(2) = fe)- 3 SUF cv) 


Then g is regular in D,—C, and at each c, it has a removable 
singularity. By theorem 11.62, 


[ s=o. 
T 


We shall prove that, at a point c(= c,) of Cy, 


| SC, c) = 2nin(y, c) res Cf, c). 


Since c is not on y, there is a disc with centre c containing no 
point of y. The power series in 1/(z—c) _ 


Sf, c) r 9 a7 > 


converges uniformly on y. Hence 


| SU, ¢c) = > a, i (z—c)-"dz., 
¥ 1 Y 
Since (z—c)~” has, for n > 2, a primitive (z—c)-"*1/(—n+1), it 


follows that i (z—c)-"dz = 0 for n > 2. 
Y 


Finally, a_, | (z—c)—1dz = 27i n(y, c) res (f, c), from the definitions 
z 


of index and residue. 
From the definition of g and the vanishing of its integral on y, 


| ; y= 1TH = n(Y, C;,) res (fy C;). 


Since n(y, c) = 0 if c € C,, the right-hand side is 
dni > ny, c) res(f, c). | 
ceC 


12.6. Counting zeros and poles 


Suppose that / has a. zero of order /at a. Then, from the definition 
of a zero (§12.1), there is a disc B(a; 6) in which 


F(Z) = (2-a)'g(2), 
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where g(z) is regular and not zero. In this disc 


F(Z) = Kz—a)"g(z) + (2 — a)'g"(2). 


Hence, for 0 < |z—al| < 6, 


£0 © 1-20 
Jay? 2-e. gy 
where g’/g is regular in B(a; 6). 

Therefore f has a simple pole at z = a with residue /. 

Similarly, if f has a pole of order m at a, then f’/f has a simple 
pole there with residue —m. 

The following theorem which enumerates the excess of the number 
of zeros over the number of poles of a function meromorphic in a 
region is particularly useful when the function is regular and we are 
counting its zeros. 


Theorem 12.61. If {( + 0) is meromorphic in the region D and y is a 
Positively oriented simple circuit in D, passing through no zero or pole 
of f, homotopic to a ee in D, then 


Ia 
- I ee eae 
where N is the number of zeros and P the number of poles of f enclosed 
by y (a zero or pole of order | being counted I times). 


Proof. The index n(y, z) is 1 for points z enclosed by y and 0 for 
other points. The theorem follows from theorem 12.5 and the first 
two paragraphs of §12.6. | 


Corollary. If g is regular in D, then 
ef Ee) 
ani | Oe ee 
summed over the zeros a; and the poles b, of f enclosed by y. 
Adapt the argument of the theorem. | 


In applying theorem 12.61 we observe that 


ed) eon ae 
ni Jy f(z) ta dri Jr w’ 
where I corresponds to y by the mapping w = f(z) from the z-plane 


to the w-plane (exercise 11(a), 13). The w-integral is the index n(T, 0) 
of I’ for the origin. 


394 EXPANSIONS, SINGULARITIES, RESIDUES [12.6 

The reader should work out for himself the application of this 
principle to numerical examples. The following exercise is typical. 
Exercise 12 (c), 0. Find how many roots of the equation 


— 378+ 7z2—9z+6 =0 
lie in each quadrant. 
The solution is at the end of the book. 


Here is another result which is useful in root-counting. 


Theorem 12.62. (Rouché (1832-1910).) Suppose that f and g are 
regular in D. Let y be a simple circuit, homotopic to a point in D. Tf, 
for every z on y, |g(z)| < |f(2)|, then y encloses the same numter of 


zeros of f as of f+g. 
Proof. Since |g| < |f|, neither f nor f+g has a zero on y. The 


point w = 1 422) Z) for z on y, lies in the disc B(1; 1). By §11.7 (3), 


fi)’ | 
if [is the w-image of y, then 
l dw _ 
2ni Jr Ww me ©. 
lee fe 
ee a mee < 5 ae 


Integrate this round y, and divide by 27i. The theorem then 
follows from theorem 12.61. | 


A quick deduction from Rouché’s theorem is: A polynomial of 
degree n has n zeros. 


Proof. Let {@) = 2,20) = "4. +4. 
If y is a circle, centre 0 and radius r, where 
r” > |a,|r?41+...+|a,|, 

the polynomial f+g of degree n has the same number of zeros inside 
y as z” has. But z” has n zeros at z = 0. | 

A numerical illustration of Rouché’s theorem follows. 

Example. Show that the roots of the equation 

z°>—12z7+14 =0 

lie between the circles |z| = 1 and |z| = 3. How many lie inside the 
circle |z| = 2? 
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Solution. On |z| = 3, take f = 25, g = —12z?+14. 


Then lg| < 75414 < 3125/32 = |f|. 


So f+g = Ohas the same number of roots inside |z| = 3 asf = 0, 


namely 5. 
On |z|.= 1, take f = 14, g = 2°— 122%. 
On |z| = 2, take f= —122°,g = 2°+14. 


Exercises 12(c) 
1. Evaluate the integrals round the unit circle, described positively, of 
1 
24(z— 2*(z— 2) 


2. Prove that, for all sufficiently large R, the integral of (z—a)-"(z—b)-” round 
the circle |z| = R is 0. 


aie 1 
sn, er, 
Zz 


3. Prove that z8+3z?+7z+5 has two zeros in the first quadrant. How many 
zeros are in |z| < 4? 


4. Prove that, if a and 5 are positive, the equation 
z++az+b=0 


has one and only one root in the first quadrant, and that its phase is greater 
than i77. 


5. Prove that z°+ 3z*— 5z?+ 10z?—1 has three zeros with rez > 0. 
Prove that it has two zeros in the ring + < |z| < 1. 


6. Prove that the equation e** = z” has n— 1 roots inside the unit circle if n > 2. 


7. Prove that the function sinh 7z takes every value wy such that re wy > 0 
exactly once in the region rez > 0, —4 < imz < }. 


8. Find the number of zeros of z(z2"+1)+1 having —7/n < phz < a/n. 
9. Prove that, if |a| < 1, then 


a ( —— )’ a 
1—az 
10. Prove that the function sin z—z has just two simple zeros in each of the 
- 
— (Qn—4)7 < rez < (2n+4)7, 


has m+n zeros in |z| < 1. 


where n is an integer other than 0; and that, apart from the triple zero at z = 0, 
these are its only zeros. 


11. Prove that, however small 6 is, all the zeros of the function 


Leta gt. aes 
az" niz” 


lie in the disc B(O; 4) if x is large enough. 


396 EXPANSIONS, SINGULARITIES, RESIDUES [12.7 


12.7. The value z = © 


In §§10.4, 10.5, we paraded the advantage of adjoining a point at 
infinity, z = 00, to the complex plane Z. The treatment of the 
bilinear transformation was neater in the extended plane Z. In 
chapters 11 and 12 (so far) the value oo has been left on the shelf, for 
a good reason. Everything has been based on integration, and the 
inclusion of z = © in a path of integration makes its length infinite 
and entails an additional limiting operation. 

We assume that fis defined for all (finite) z if |z| is sufficiently large. 


Definitions. f(z) is said to have an isolated singularity for z = 00 if 
f(1/z) has an isolated singularity for z = 0. Moreover the former is a 
pole or an essential singularity according as the latter is. 

f(z) is said to be regular at © if f(z) is regular for |\z| > r and 
bien Ff(1/z) is finite. (By theorem 12.4, the latter condition holds if 


lim zf(1/z) = 0.) 


Illustration. At z = oo, the function z? has a double pole and sin (1/z) a 
simple zero; exp z has an essential singularity. 


We can now state Liouville’s theorem 12.22 in a more cogent 
form. 
A function regular in Z is a constant. 


Proof. From the definition of regularity at oo, there are constants 
A, 9 such that |f(z)| < A if 0 < |1/z| < 7. Now, regularity of f for 
all finite z shows that |/f(z)| < B if |z| < 1/7. Theorem 12.22 gives 
the result. | 


Another useful result, extending Liouville’s, follows. 


Theorem 12.71. If f is meromorphic in Z, then f is a rational function 
p/q, where p and q are polynomials. 


Proof. The number of poles of fis finite, since an infinite set would 
have a limit point either at z = a (finite) or z = «©. f would have no 
Laurent expansion about this limit point and it would not be a 
regular point or a pole of f- 

Let ay, do, ..., @, be the finite values of z which are poles of f/, and 


let 
S;,(2), S2(Z), eeny S,(Z) 
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be the respective singular parts of fat these poles. If z = 00 isa pale, 
the singular part there is of the form 


S.(Z) = Byz +... + Bey Z™. 
Then f(@)-¥ S,(2)- Sal2) 


has no singularities for any z in Z. Therefore by the form of Liou- 
ville’s theorem just stated, it is a constant A. So 


fore Ae 3 S.(z)+S.(2) 


which is a rational function. | 


In further developments the best setting will be sometimes Z and 
sometimes Z. In a discussion of singularities of a function, the 
behaviour at oo should be specified. 

We shall not use the residue of a function at oo. The definition 
(which needs care) is in a note at the end of the chapter. 


12.8. Behaviour near singularities 


Suppose that a is an isolated singularity of f. 

If a is a pole of order n, the definition shows that | f(z)| tends to 
infinity as z tends to a, and its order of magnitude is given by 
f(z) ~ a_,(z—a)™. In contrast, the behaviour of f(z) as z approaches 
an isolated essential singularity is erratic. 


Theorem 12.8. (Casorati-Weierstrass.) Let a be an isolated essential 
singularity of f. Given any complex number b and arbitrarily small 
positive numbers r, there exists z in the punctured disc 


0 < |z-al <r 
such that | [f(2)-5| < e. 


(At points arbitrarily near to a the function takes values arbitrarily 
near to any given number.) 


Proof. If thereisazin0O < |z—a| < rwhere/(z) = 5, the theorem 
is proved. If not, write : 


ae 22 


Then g is regular in 0 < |z—al <~r. 
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If g is bounded, then a is a removable singularity of g and 
f = 6+(1/g) has a removable singularity or a pole at a. In fact f has 
an essential singularity at a. Therefore g is unbounded and there is a 
z for which |g(z)| > I/e. | 


Illustration. exp (1/z) assumes all values except 0 arbitrarily near to 
f= 0, 


Proof. Write exp (1/z) = b = pe™ say (p > 0). 
Then z-1 = log p+i(a+2nm). 

To make |z| < r, we have only to take the integer n large enough 
to make (log p)?+(a+2n7)? > r-*, | 

When a = o the results of this section are of daily relevance. If 
f(z) is a polynomial Pi eae 
where a,, + 0, then oo is a pole of order zn. 

If f(z) is an infinite series in powers of z (or z—Z,) convergent for 
all z, then f is an entire function and oo is an isolated essential 
singularity. 

Entire functions, a class which includes some of the most useful 
functions of analysis like expz and sinz, show the Casorati-— 
Weierstrass behaviour outside any circle |z| = R. The function exp z 
actually takes all values except 0 for arbitrarily large |z|. 


Exercises 12(d) 
In each of 1-8, state for what values in Z the function is singular and specify the 
nature of the singularity 
z+1 z-1 42? 


_ z=1 2+1 22 


e 


z4 


° 1423" 
ef 
1+2°° 
4. tan z. 


5. tanh z. 


6. exp (tan Z). 


ae 
7. sin ce 
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a ae a 
9. Prove that the entire function cos z assumes all values for arbitrarily large 
|z|. (cos z has no exceptional value such as 0 is for exp z.) 


NOTES ON CHAPTER 12 


§12.7. Definition of residue of f at «. If f has a Laurent expansion valid for 
|z| > R, and a_, is the coefficient of 1/z in it, then —a@_, is defined to be the 
residue of f at ©. 

We have come to expect the occurrence of residues only at singularities. This 
definition assigning, as it does, a residue to the function 1/z which is regular at 00 
seems perverse. The rationale of it is that the role of the residue of fat a point Z» 
is to pick out the part of the function which has a non-zero integral along a 


[o.) 
circuit round z,. For instance, the residue at z = 0 of >) a,z” (with sum f(z) for 


— oo 
1 
i 271 [ oP 

where C is (say}a positively oriented circle with centre z = 0. Positive orientation 
is equivalent to the point 0 lying on the left as the circuit is traversed. Now a 
circle |z| = R is a circuit round z = 00, and, to keep oo on the left, it must be 
traversed clockwise. (The Riemann spherical representation §10.4 illustrates 
this.) Thus we arrive at the definition of the residue at 00 as being —a_,. 

The introduction of the residue at 00 would have the following consequence. 

If a function has a finite set S of singularities in Z, the sum of the residues at 
points of S, with oo included, is zero. 


0 < |z|) is a_, and 


§ 12.8. Some writers attribute theorem 12.8 to Sochozki. 

Picard’s theorem. Theorem 12.8 shows that an entire function (other than a 
polynomial) assumes values arbitrarily close to any assigned number. We noted 
that the special function exp z is, by choice of z, equal to any assigned number, 
except 0, and does not merely take values near to it. This is a general truth for 
entire functions, discovered by E. Picard (1856-1941). His original theorern 
(1879) was that a function meromorphic in Z which omits more than two values 
is a constant. The proof depended on assuming, on the contrary, three values to 
be omitted, projecting them into 0, 1, co and then drawing on knowledge which 
was already available about a class of functions (elliptic modular functions) 
which fail to take just those three values. Picard’s theorem led to a vast literature 
in which the names of Landau, Schottky, Bloch, R. Nevanlinna and F. Nevan- 
linna are prominent. For a text-book account of the subject (which is impossible 
for us to broach) see, for instance, Hille, Analytic Function Theory, vol. t. 
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GENERAL THEOREMS. ANALYTIC 
EUNCTIONS 


13.1. Regular functions represented by series or integrals 


The outlook in this chapter is wider than in chapter 12, and some 
of the results are more general than are needed in every-day analysis. 
The reader who is primarily interested in applying his knowledge 
to special functions can read chapter 14 with occasional reference 
back to the present chapter. 

Taylor’s theorem 12.11 exhibited a regular function as the limit 
of a sequence of polynomials. Laurent’s theorem 12.31 allowed 
negative as well as positive powers of the variable. The present 
section contains representation theorems in a much wider setting. The 
first, due to Weierstrass, replaces Taylor’s series in powers of z by a 
uniformly convergent sequence of functions on which the sole 
restriction is regularity. 


Theorem 13.11. Suppose that (i) for each n, s,(z) is regular in the 
region D; and (ii) on every compact subset of D, s,(z) tends uniformly 
to s(z) as n tends to «©. Then (1) s(z) is regular in D, (2) the derivative 
of any order p satisfies sz) = lim siP)(z). 


Proof. Let a be any point of D. Choose r small enough for the 
closure of the disc B(a;7r) to lie in D. Then s(z), the uniform limit 
of the sequence of continuous functions s,,(z), is continuous in B(a;7r), 
and so at every point of D. 

We shall use Morera’s theorem 11.92 to prove that s(z) is regular. 
Let y be any circuit in B(a@;r). Since s,(z) tends uniformly to s(z) 
on y, we have, from theorem 11.12, 


| Sale) de > | se) dz, 


For each n, the left-hand side is 0 (by Cauchy’s theorem 11.5). 
So the right-hand side is 0 and, by Morera’s theorem, s(z) is regular 
in D. 

To prove (2), let C = C(a;r) be the circular boundary of B(a; r), 
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positively oriented. Then, since s,(¢) — s(¢) uniformly for ¢ on or 
and (€—a)-?-1 is bounded on C, 


(0) (0) 
el ee eae lage qr A 


Multiplying by (p!)/27i, we have (2).. | 


Corollary 1. The convergence of s‘?(z) to sz) is uniform on every 
compact subset of D. 


Proof. We refine the proof of (2). Let z be any point in the closure 
of the disc B(a; 4r). Then 


oe oes ee at, 


aan tote aye 
since now |€—z| > 4r. This gives 
lim sz) = sz) 
uniformly on the closure of the disc B(a; 47), that is to say, given e¢, 
|sP(z)—s®(z)| <e if n> N(e,a) and ze B(a;4r). 


Let now K be a given compact subset of D. By the Heine—Borel 
theorem 3.7, a finite number of points a can be found such that the 
(open) discs B(a; 4r) associated with them cover K. 

If N is the greatest of the corresponding numbers N(e, a) defined _ 
above, we have 


|s?(z)-—sP(z)| <e if n>N, forallzin kX. | 


Corollary 2. The hypothesis (i) of the theorem may be replaced by 
(1)’ s,(z) is regular in D,, where D, | Dys,(n = 1,2, ...) and 
Pp =1)D... 


Proof. At each stage of the proof of the theorem we are working 


with a compact set, say K. If K < U D,, then, by the Heine-Borel 
; 


N 
theorem: 3.7, K < U D,..Since. D,, —_D,,., we have K < Dy. We 
1 


ignore those s, for which n < N and argue as in theorem 13.11. | 
Illustration. SAZ) = aa a 


is regular in D,, the disc |z| < 2-1/". s(z) = zin|z| < 1. This example is given by 
Ahlfors, Complex Analysis (p. 174). 
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Theorem 13.11, defining a regular function as the limit of a 
sequence (or as the sum of an infinite series) will have a parallel in 
which summation over the positive integers is replaced by integration 
with respect to a continuous variable u. We suppose u to be real, as 
it usually is in practice; the extension to a complex variable and a 
prescribed path of integration is straightforward. 

We state two theorems about the regularity of a function repre- 
sented by an integral, in the first over a finite interval and in the 
second (the analogue of theorem 13.11) over an infinite interval. 


Theorem 13.12. Let s(u, z) be a continuous function of the pair (u, Z) 
forO0 <u < 1,zinaregion D. Let s(u, z) be regular in D for each u. 


Then | 1 
I(2) = F s(u, z)du 


is regular in D, and its derivatives are given by 
e 1 9p 7 
J?) = I a7 S(u, Z) du. 


Proof. Since s is continuous, fis continuous in D. 

As in theorem 13.11, given a disc in D we take in it a circuit 
with a representation z = z(t). Then, since the integrand is continuous 
in (u, Z) and the integrals along y are real integrals with respect to ¢, 
it can be seen from theorem 8.31 that 


i du [ su, dz = { ; dz { s(u, Z) du. 


The left-hand side is 0, since, by Cauchy’s theorem 11.5, the inner 
integral is 0 for each u. Therefore the right-hand side, namely 


| f(z) dz, is also 0. By Morera’s theorem 11.92, fis regular in D. 
Y 


To prove the formula for f®, let C be a circle with centre z lying 
in D, From theorem 11.91 


i - s(u, C)du. 


The integrand of the repeated integral is continuous in (u, €) and 
we may invert the order of integration, obtaining 


19 
f(z -| = eT s(u, z)du. | 
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Theorem 13.13. Let s(u, z) be a continuous function of the pair (u, Z) 
for u > 0 and for all z in D. Let s(u, z) be regular in D for each u. 


Suppose that, on every compact subset of D, | 5 s(u, z)du converges 
uniformly to f(z). Then f(z) is regular in D and 
f®X(z) = 2 set s(u, Z) du. 
9 ozP 
Proof. The only modification needed in the proof of theorem 
13.12 is to appeal to uniformity of convergence of |, ” s(u, z)du to 
assert the continuity of f and to justify inverting the repeated yea gy 


in which it occurs. | 


An alternative extension of theorem 13.12 caters for unbounded- 
ness of the integrand s(u, z), say as u->0+, instead of integration 
of s(u, z) over an infinite range [0, 0) in theorem 13.13. 


Exercises 13 (a) 


1. Investigate in what regions of Z (if any), the following limits of sequences (or 
sums of series) represent regular functions: 


@ lim , a (i) lim Z, Gi) lim =p 
"A001 @ HEA) be 
ci : sin nz (viii) 3 sin nz 


2 ee as in 1 these integrals: 


{= @ fp) 53 ay f° a 
(i) {, Meee (ii) {, exp (— zt*) dt, (iii) ‘i exp (— zt”) dt, 
1 ° co e 1 ° 
ot ae () { SaaS (vi) { oe dh 
"eee a ee a ae 
(vii) [ alee (viii) lim | om, 
1 t é T-+0o J-T at+it : 


3. Functiens f, g ate given which are regular in Z. From 1 (i) or otherwise 
construct a sequence ¢,, such that 


fight 2 al <1, 


ee pe if |z| > 1. 
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4. Show that, for every fixed N, 


= 1 
Fu(@) = Get 2) (41) 


converges uniformly on the annulus 1/N < |z| < N. Deduce that F,(z) is regular 
except for certain singularities, and state the nature of the singularities. 


co 

5, In theorem 13.13 prove that the convergence of i ?s(u, z)/éz?du to f(z) 
0 

is uniform on every compact subset of D. 


6. Apropos the remark following theorem 5.62, give an example of a function 
of z continuous in a domain D which is not the uniform limit of a sequence of 
polynomials in z. 


13.2. Local mappings 


The results of chapter 12, particularly §12.6, enable us to analyse 
the local properties of the correspondence effected by w = f(z). We 
studied simple examples in chapter 10. Throughout this section we 
shall assume that fis not a constant. - 


Theorem 13.21. Let f be regular at a, and let f(z)—b have a zero of 
order n for z = a. Then, if € is small enough, there is a corresponding 
6 such that, for every b’ in the disc |b—b'| < 4, the disc |z—a| < € 
contains n zeros of f(z)—0’. 


Proof. The zeros of f being isolated (§12.1), we may suppose ¢ 
small enough for a to be the sole zero of the regular function f—b 
in the closed disc |z—a] < «. 

We can take 6 > 0 such that 


If@)-b| > 8 
for every z on the circle |z—a| = «. 
Let b’ satisfy |[b—b'| < 0. 
By Rouche’s theorem 12.62, 
f(z)-b and {f(z)—b}4+ {6-5} 
have the same number of zeros inside the circle |z—a| = ¢. The 
former has n zeros and therefore so has f(z)—)b’. | 


Corollary 1. A function regular on an open set G and not constant on 
any of its components maps open subsets of G onto open sets. 


Proof. With the notation of the theorem, if fis regular at a, and 
f(a) = b, the z-disc B(a; €) is mapped onto a w-set containing the 
disc B(b; 6). | 
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Corollary 2. If fis regular at a and f'(a) + 0, there is a neighbourhood 
of a in which w = f(z) has an inverse mapping regular at b = f(a). 


Proof. Since f'(a) + 0, we have n = 1 in theorem 13.21. By con- 
tinuity of f’, we can take the e in the theorem small enough for /’ to 
be different from 0 in the disc B(@; €). 

There is a bijective mapping of B(b; 6) onto an open subset G of 
B(a; €) containing a. The function f restricted to G has an inverse 
g and, by corollary 1 and theorem 3.23, g is continuous. 

Given b, and w, in the disc B(b; 4), there are unique a, and z, in 
G with g(b,) = a, and g(w,) = z,. Then 

g(w,) —g(b,) 2447, 
-b, fe) f(a)’ 
Let w, > b,. Since g is continuous, z, > a, and the right-hand side 


tends to 1/f’(a,), since f’(a,) + 0. Hence g’(b,) exists. Since 5, is an 
arbitrary point in the disc B(d; 6), g is regular at b. | 


Theorem 13.21 yields a basic proof of the Maximum Principle 
(which is already contained in exercise 12(a), 14, a deduction from 
Cauchy’s integral formula. See exercise 13(5) 7(i).) 


Theorem 13.22. (The maximum principle.) If f is regular in a region 
D (and not constant), then |f\ does not have a maximum in D. 


Proof. If a is a point of D, and b = f(a), then any disc A (in D) 
with centre a is mapped onto an open set G containing b. G contains 
points with modulus greater than |b]. | 


Corollary. If f is regular at every point of a compact set E, then 
| f| attains its supremum at a frontier point of E. 


The maximum principle has many consequences. As Haeneiiens 
we append theorems 13.23 and 13.24. 


Theorem 13.23. (Schwarz’s lemma.) Suppose that f is regular and 
| f(z)| < M for |z| < R, and that f{(0) = 0. Then either 


|f(2)| < sl | jor 0.< |zL.« 2 


or Fiz) = ve ze (a real). 


Proof. We may take M = R = 1, and the general case will then 
follow by applying that result to f(Rz)/M. 
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Write g(z) = f(z)/z for z+0 and g(0) = /'(0). 

From theorem 12.4, g is regular for |z| < 1. 

Given any point Z) with |zo| < 1, let |z)| < r < 1. Then there is 
at least one value z, with |z,| = 7 such that 


vA 
IgZo)| < |g) = vel Mg! se 
This is true, however near r is to 1, and therefore 


|g(Zo) | <1 


If there is a € such that |g(¢)| = 1, then, by theorem 13.22, g is 
constant, say g(z) = A, where |A| = 1, and f(z) = Az. If there is no 
such ¢, then |g(z)| < 1 for |z| < 1, that is to say |f(z)| < |z| for 
C= ise 1] 


Corollary. With the hypothesis of the theorem either ~ 
|f’O) | < M/R or f(z) = a ett 


Proof. In the theorem, either |g(0)| <1 or g(z) = A, where 
|A| = 1. From this the inequality for | f’(0)| follows. | 


The next theorem gives finality to §10.5 (8). 


Theorem 13.24. Every bijective conformal mapping of a disc onto 
another is bilinear. 


Proof. We lose no generality by taking the discs to be |z| < 1 and 
|\w| < 1. 

(i) We prove first that, if z = 0 is mapped into w = 0, the mapping 
is w = ze (a real). 

Denote the mapping by w = /(z) and its inverse by z = g(w). 

Then if2)| <1 if [zl <1. 

By theorem 13.23 (corollary), 

either (/(0)] < 1. of {@) = ce™. 
Similarly ei < f° or iy = we. 


Since f'(O)g'(0) = 
we must take the second alternatives. 
(ii) Let the mapping w = L(z) transform |z| < 1 onto |w| < 1 


13.2] LOCAL MAPPINGS 407 
From §10.5 (8) the transformation 


Wo 


= 10) = Ft 


maps |w| < 1 onto |¢] < 1. 
The compound transformation ¢ = T{L(z)} maps the disc |z| < 1 
onto |¢| < 1 with z = 0 mapped into ¢ = 0. By (i) of this proof, 


TiL@)} = ze™. 
Therefore Ei) = Tze) 


“> We 76" 
1 i= Wo ze’ 
which is bilinear. | 


Exercises 13 (5) 
M(r) always denotes sup | f(z)| for |z| = r. 


1. Calculate M(r) for the functions 
j Tagez: me 2 (—1)"z* 
(i) exp iz, (ii) sin z, (iti) It eee 


2. Prove the minimum principle: if f is regular and not constant in a region D, 
then |f| can have a minimum in D only at a zero of f. 


3. If f is regular in B(O; R), where R > 1 and M(1) < 1, prove that there is 
just one point z, in B(O; 1) for which f(z9) = Zp. 


4. If fis a polynomial of degree n, prove that 
M(r;) _ Mr.) 


ry ry 


for 0 < r, < rg, with the sign of equality only if f(z) = az”. 


5. f is regular and bounded in B(O; 1). As r->1-—, f(re*%) +0 uniformly for 
a<0c«€ ff. Prove that f= 0. 


6. The three-circles theorem of Hadamard (1865-1963). Let f be regular for 
ry <|z| < r,. By applying the maximum principle to z?{f(z)}4 for suitable p, g, 
prove that, for r; < r < rs, 


M(r) 08 /t) < M(r,)108n M (rq) '08 'r9, 


Verify also that log M(r) is a convex function of log r. (A function is convex if 
the arc joining any two points is below (or coincides with) the chord.) 
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7. Let f be regular in B(O; R) and write 
Qn 
lin) = 50 | reIdd O <r < B 
20 0 


(See also exercise 12(a), 14.) Prove that 
(i) J,(r) is a continuous, increasing function of r, 
(ii) |f(O)|)? < L(r) < {M(r)}, and deduce theorem 13.22, 
(iii) log J,(r) is a convex function of log r. 


13.3. The Weierstrass approach. Analytic continuation 


We now look more closely into the foundations of complex 
function theory. The development in which the integral theorem 
{ fdz = 0 takes the central place is due to Cauchy. Other avenues 
of approach were followed by Riemann and by Weierstrass. As a 
prologue to the rest of this chapter the reader should look again at 
the discussion in §10.2 of the two values of w arising from w? = z 
and at the uniqueness theorem 12.13 for regular functions. 

Weierstrass developed complex function theory using power series 
as the standard representation of a regular function (in the circle of 
convergence). It may help the reader if the general discussion is 
accompanied by examples chosen to be as simple as possible. 

Suppose that we are given a regular function / defined in a region 
D. An immediate question is—can the domain of definition of f be 
extended to points outside D? 

Keeping theorem 12.13 in mind, let us agree on terminology and 
state some facts. 


Definition. A regular function f defined in a region D will be called 
a function element, written (f, D). 

Suppose that (fo, Do) and (f;, D,) are function elements, where Dy 
and D, have common points. Then Dy D, is open; without restrict- 
ions on D, and D, it need not be connected. If Dy and D, are 
convex, as we shall always in future assume, Dy / D, is connected. 

Suppose that f(z) = A(z) for all zin Dyn D,. 

Then (f, D,) U D,), where f = fo in Dy and/ = f, in D,, is a function 
element. 


Definition. With the assumptions just made, each of (fo, Do), (f1, D1) 
is called a direct analytic continuation of the other. 
It may be possible to go on with this process and form a chain of 


function elements | 
fas Dy), Ci Dj), aieeelS Sus D,) 
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of which each (after the first) is a direct analytic continuation of the 
preceding one (each D convex). 


Definition. Any two function elements of such a chain are called 
analytic continuations of each other. 


Example 1. 
f(z) = 14+(01-24+(1-2)?4+...4+01-2)"+ 


D, is the disc |1—z| < 1. 

This example is artificially simple because the power series defining 
fo is a geometric progression which, for z in Dy, can at once be 
summed to the rational function 1/z. So the problem of extending 
the definition of f, outside D, can be solved by declaring that f(z) 
shall be 1/z for all z except the singularity z = 0. But if the series for 
fo had been one without a simple formula for its sum, we could have 
taken _@- 2" 


qrti 


to, e- 7 


5 a +... 


ict 


i,Z) = 


with D, the disc mee < |a|, that is to say the interior of the circle, 
centre a, through the origin. 

If a is not real, D, contains points outside Dy and f, = fo in 
Dyn Dy. 


a 
er. aa 
oo ee i i. 
= -' 3(1- a, q- a a 2.4.6... 2n ee 


D, is the disc |1—z| < 1. 
Again we have chosen a series whose sum is an elementary function, 
to enable us to visualize more clearly the process of continuation. 
In this example let us write g(z) = ./z if, in some specified domain, 


a 7 the fe) = 0-2} = 


the principal value of /z in D, (see exercise 12(a), 1). @ BY. 

Now construct a chain of function elements. We can take, for 
instance, the region D,, to be the interior of the circle with centre 
exp inmi and radius 1. Each D,, overlaps the preceding one, and D, 
is the same region as Dy. As we take a sequence of values of z (say, 
lying on the circle |z| = 1) in the regions common to the pairs 
D,, D1; D,, Dz; ...; Ds, Do, then ./z, starting at z = 1 with the value 
1 will return to z = 1 with the value —1. 
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So the function elements (fo, Do) and (fg, Dg) are different although 
the regions D, and D, are the same. If we pursue the process of 
continuation through Dy, ... to Dg, making a second circumnaviga- 
tion of the origin, the function element (f,,, D,,) taking the value + 1 
at z = 1 is the same as (fo, D,). 


The only restriction so far laid on the region D in which a function 
element is defined is convexity. In writing (f, D), where D is a disc, 
we do not suppose, unless we say so explicitly, that D is the largest 
disc in which fis regular. In particular, if fis defined by a power series, 
D may be its disc of convergence or any disc (not necessarily con- 
centric) inside it. 

We go on to prove a simple theorem (13.3) bearing on the question 
whether (f, D) possesses direct analytic continuations to points 
outside D. We shall then give examples ot Tanction elements which 
have no such continuations. 


Theorem 13.3. Let D be the disc of convergence of a power series with 
sum f. Then there is at least one point b on the frontier of D such that 
(f,; D) has no direct analytic continuation (f,, D,) for. which b € Dy. 


Proof. Let D be the disc B(a; R). We have to prove that there is a 
b with |b—a| = R for which there is no disc D,, with centre b, in 
which a function element (f,, D,) continues (f, D). Suppose, on the 
contrary, that there is no such b. Then, by the Heine—Borel theorem 
3.7, there are n elements (/;, D;) each continuing (f, D) such 


n 17 
that U D, contains every point of |z—a| = R. The set U D, covers 
1 1 


an annulus R, < |z—al| < R,, where R, < R < R,. We have to 
show that f(z) = f(z) for z¢ D;n D;. This follows from theorem 
12.13, since, if D;n D; + 2, then D;n D;n D + @ and 


F(Z) = F(Z) = F{2) 
for ze D;n D;n D. The functions f, thus define a function regular 
in the disc B(a; R,), and so the power series for f converges in this 
disc. This contradicts the assumption that R was the radius of con- 
vergence of f. | 


In examples | and 2 above there is no direct analytic continuation 
into any region containing z = 0. In 1 this point is a pole of the 
function represented by the power series, in 2 a branch point (see 
§ 10.2). 
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Example 3. The power series 
f(z) = 1424+22+28+...42™4... 

cannot be continued to any point outside its circle of convergence 
ae i, 

Proof. Let p/q be a rational number. 7 

Let z = rexp (27 p/q), where r takes values tending to 1 from 
below. | 


Then f(2) = 5 4 ZnS 
q 


The second sum on the right tends to as r tends to 1, and so | f(z) | 
tends to 00 as z tends to exp (27 p/q) along the radius. 

Let D be a region containing points of B(O; 1) and of its comple- 
ment. Then D contains points exp (27i p/q). Any function (g, D) 
which coincides with f in Dn B(O;1) is discontinuous at these 
points exp (27i p/q). So there can be no continuation of f outside the 
circle. | 


In example 3, the circle |z| = 1 is a natural frontier of f. 


13.4. Analytic functions 


In attaching the word regular to a differentiable function (§10.2) 
we said that the term analytic function was being reserved for a wider 
meaning, which we now explain. 

The relation of being analytic continuations of each other is an 
equivalence relation between function elements. This fact serves to 
define an analytic function. 


Definition. An analytic function is an equivalence class of function 
elements. 

If, for each value of z for which it is defined, an analytic function 
has one value only, it may be called one-valued or uniform. 

If there are values of z for which function elements have different 
values, the analytic function is multiform (or many-valued). 

Analytic continuation along a curve. Let y be a curve given by 
z = z(t) for0 < t < 1. Let (fo, D,) be a function element defined in 
a disc D, containing z(0). Let 


0 = lo < ty eg Laid = | 


be a dissection of [0, 1]. Suppose that there is a chain of function 
elements (f;,, D,) such that the arc z = z(t) for t, < t < ty,4, lies in 
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the disc D,(k = 0, 1, ... n). We say that the function element (f,, D,,) 
is obtained from (fo, Dy) by analytic continuation along the curve y. 

To justify this definition we have to prove that such analytic con- 
tinuation is unique in the sense of the next theorem. 


Theorem 13.41. If (fos Do); ec fn: D,) and (Po, A), Bo (Ps A) are 
chains of function elements along a given curve and f(z) = $,(z) for 
zED,nAg, then f,(z) = ¢,,(z) for z€ D, NA, 


Proof. Suppose that the dissections 
O = ft, < Fy os << Ent = 1 
and Of TS 7) CS geet 


are such that the arcy[t,,t,,,]) ¢- D,(@ <i<n) and the arc 
YIT;. T3411 < A; <7 < m). 

The theorem is true if we prove that if the intervals [t,, t;,,] and 
[7;, 7341] have a common point then f,(z) = $,(z) for ze D,n Aj. 
Suppose that there are values of i, 7 for which f; = ¢; in the non- 
empty region D; n A;. Take those for which i+/ is least. Suppose that 
t; > 7; Then i > 1 and the point z(¢,) lies in D;_,n D;n A;. Since 
i+j is minimal, it follows that f/;_, = ¢,; in D;_, Ay. But f,_, =f; in 
D;-1 9 D,;. 

Therefore f; = ¢,; in the non-empty region D;_,n D;nA,. But f; 
and ¢, are regular in the region D;n A,. By theorem 12.13 f;, = 9; 
in D,n A;. This contradicts the definition of i,j. | 


Continuation along homotopic curves. Suppose that analytic con- 
tinuation of a function element (fp, Dy), where Zz) € Do, is possible 
along two different curves y, and y joining Zz, to a point z. We enquire 
whether both curves lead to the same function element (f, D), where 
zé D. The nature of analytic continuation as a procedure based on 
overlapping regions suggests the likelihood of the two curves leading 
to a common function element if one curve can be continuously 
deformed into the other, that is to say, if y, and y are homotopic with 
the fixed end points Zo, z (as defined in §11.6). 

We first deal with curves in a disc. 


Theorem 13.42. Let A be a disc containing D,. Suppose that (fo, Do) 
can be continued along every curve in A. Then the analytic function thus 
defined is one-valued in A. 

Proof. Let ¢ be the centre of A. Let z, be a point of D,. Continua- 
tion along the straight segment [z,, ¢] leads to a function element 
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(f, D) where €€ D. Take a disc D,, with centre ¢, contained in D. 
Then f may be supposed to be represented by its Taylor series about ¢. 

By hypothesis we may pursue the continuation of (f/, D,) along 
any radius of A. By theorem 13.3, the Taylor series converges at 
all points of A. | 


The extension from a disc to a more general region can now be 
made as in Cauchy’s theorem. 


Theorem 13.43. (Monodromy.) Suppose that, in a region D, two 
curves Y», y are homotopic with fixed end points zy, z and that (fo, Do), 
where z,€ Dy, can be continued along all curves in D. Then continua- 
tions along y, and y lead to a common function element in a region 
containing Z. | 


Proof. We operate with the discs B,; and curves y;;, 4;; defined in 
theorem 11.61. By theorem 13.42, continuation along the curve y;; 
is equivalent to continuation along 9;; ¥;,j41. —M41,; in order. Fix 
jand let i take the values of the t-dissection. Then continuation along 
y; and y;,, lead to a common function element in a region containing 
the final end point z. This is then true for y, and y. | 


The most useful special case of theorem 13.42 is the following. 


Corollary. If D is simply connected and (fo, Dy) can be continued 
along all curves in D, the analytic function thus defined is one-valued 
in D. 

Exercises 13(c) 
1. Which of the following functions on —1 < x < 1 have regular extensions to 


regions in Z; : 
: : vx?, sinx, tan47x, cos |x|? 


2. Prove that, if fy= a Gazi <1); 
then f(z) = z+f(z?). 


Deduce that f cannot be continued over the unit circle. 


3. Assume that, if x > 0, then 


[, * ee dt = 4y(n/2) 


(proved in § 8.6). 
Investigate the extension to z = x+iy, and by letting x > 0+, obtain the 
formula, with y > 0, 


ie cos yf?dt = 4/(7/2y).  - 
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Cc 

4. Prove that, if a, > 0, and f(z) = >) a,z” has radius of convergence 1, then f 
0 

cannot be continued over the point 1. 


5. Prove that an analytic function can take at most a countable set of values at 
a given point. Mention two functions for which the set of values is infinite. 


ie.8) 
6. Prove that the Riemann zeta function defined by €(z) = >) n-* is regular if 
1 


x > 1, and ¢(z) defined by >) (— 1)""1n-* is regular if x > 0 (n* having its principal 
1 


value). 

Prove that the function ¢(z)/(1—2'-*) provides the analytic continuation of 
¢(z) into the punctured region x > 0, z + 1. Prove that ¢(z) has a pole at z = 1 
with residue 1. 


7. The function fis regular in a region D which contains the segment 0 < x < 1, 
y = 0. f(z) is real valued on the segment0<a<x<b<1i,y=0. 

Prove that the coefficients of the Taylor expansion of f about the origin are 
real. 

Show that this conclusion may be false if it is only known that D contains the 
point z = 0 and the segmenta <x <b,y=0. 


NOTES ON CHAPTER 13 


§ 13.2. It is interesting to compare this section with §7.5 on mappings from 
R" to R". Since regular functions are much more specialised than general 
differentiable mappings, the hypotheses in theorems can be less restrictive. For 
instance, the corollaries to theorem 13.21 hold without the assumption that 
f(z) + 0; to prove theorem 7.52 we have to suppose that the Jacobian is not 0. 


§ 13.4. The Riemann surface. Consider example 2 of § 13.3 which illustrates a 
selection of the function elements which generate a multiform analytic function. 
To visualize the mapping which it effects, it is not adequate to set up a single 
plane Z as the carrier of the variable z. Rather, we need two planes Z, one super- 
posed on the other and having connecting bridges. The reader should compare 
the present section with §10.2 in which the two branches of ./z were kept 
separate by the veto on analytic continuation across a fixed ray from the origin 
(the negative real axis). 

We will now form a picture of the analytic function defined by w? = z. 
Corresponding to any value a of z, there are two function elements (the two 
square roots of z) each of which is expressible as a power series in z— a conver- 
gent in a disc with centre a and radius |a|. Imagine these discs made of paper. 
The value of a function element at a given value of z may be thought of as being 
marked on the paper. When two function elements are direct analytic continua- 
tions of each other, we suppose their discs to be glued together. In example 2, 
the disc for Dg will not be glued to Do, but Dj, is to coalesce with Dy. A model 
made of paper runs into difficulties because surfaces have to pass through one 
another. There is no trouble about this in an abstract definition of the surfaces. 

So, for the ‘two-valued’ function z*, we picture two planes, Z, being super- 
posed on Z, and joined along the negative real axis in such a manner that a point 
in Z, describing (say) the circle |z] = 1 anticlockwise from the point 1 steps up 
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into Z, on reaching the value z = —1 and, after going again round the circle 
\z| = 1, steps down into Z, on reaching z = —1. The purpose of this imagery 
is to design a structure, a Riemann surface, carrying the variable z on which the 
analytic function w is a continuous function of z. 

The model would be more informative if we include z = 0 and take extended 
planes Z, and Z,. The points (branch-points) z = 0 and z = 0 lie in both planes 
and the planes have bridges leading from each to the other along some specified 
line (say the negative real axis) joining 0 and oo. Instead of extended planes, the 
spherical representation could be used. 


14 
APPLICATIONS TO SPECIAL FUNCTIONS 


14.1. Evaluation of real integrals by residues 


In this chapter we apply to special functions the methods and 
theorems of complex analysis (mainly to be found in chapter 12). 

The theorem of residues 12.5 can be used to calculate real definite 
integrals of various types including many which have no indefinite 
integral expressible by standard functions. 

We shall take four examples. The first two are of the form 


: r(x) cos (or sin) mxdx, 
0 


where r(x) is a rational function. 

The following remark enables us to write down at sight the residue 
of a function at a pole of the first order. 

If fig has a simple pole at z = a, with g(a) = 0, f(a) + 0, its 
residue there is f(a)/g'(a). 


| F(z) _ a+ 
Proof. woes o 
where /; is regular at a. We then have for the residue 
flz\(e-a) _ fla) 
g(z)—g(a)g‘(a) 
_ The remark applies only to a pole of the first order. 


+fi(z), 


25 = lim. 


Example 1. 
| ae where m>0,a> 0. 
o AA 
exp miz 
Solution. Let f(z) = ae 


Let? = | f(z) dz, where y is the path from —R to R of the real 
Fd 


axis, and R > a. Let J -{ J(z) dz, where I’ is the semicircle |z| = R 
& 
0 < phz < 7a. Then , 


ie ee cos mx +i sin mx , ee. “COS MX 5 
x24 age“ 
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The integrand fis regular for all z except simple poles at ia and —ia. 
The circuit y +I is seen to be homotopic to a point, and its index is 
1 for ia and is 0 for —ia. 

The remark just preceding this example shows that the residue of f 
at ia is exp (—ma)/2ia. 

The residue theorem 12.5 then gives 


I+J = (m/a) exp (— ma). 


The modulus of the integrand in J is bounded by 1/(R?—a?), and so 
\J| < 7R/(R?—a?), and J +0 as Ro. Collecting our results, we 
have proved that the integral asked for is (77/2a) exp (— ma). 

Notes. (1) The essence of the proof that J > 0 could be written 


J = O(1/R®2)O(R) = O(1/R) > 0. 


(2) Where is the hypothesis m > 0 used? What is the value of the 
integral if m < 0? 

(3) In setting up the complex integrand f(z), could we not replace 
cos mx by cos mz instead of bringing in the exponential function? 


{= sin x , 


Preliminary survey. If we follow the method which was successful 
in example 1, the best upper bound that we can offer for the integrand 
on the large semicircle is O(1/R) and for the integral the bound O(1). 
But we need the closer estimate o(1) for the integral. This calls for 


Example 2. Prove that 


an argument more delicate than the boundedness of | fi by the 
x 


maximum of | f| multiplied by the length of I’. The extra delicacy is 
a reflection of the divergence of the integral 


ics 
| |sin x| me 
0 x 


The following lemma supplies the necessary refinement. 


Lemma. (Jordan’s lemma.) Let q(z) be continuous for |z| > c, 
0 < phz < 7; and let Mp be sup |q(z)| on I, the semicircle |z| = R 
0 < phz < az. Let m > 0. Then, as R> ©, 


[42 exp mizdz = O(Mp). 
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Proof. Let [ = | q(z) exp mizdz, Then 
3 


\Z| = 


| "q(R exp i0) exp {miR(cos 0+ sin 0)}iR exp i0d0 
0 
< Mp |" exp (—mR sin 0) Rdé 

0 


dn 
= 2M | exp (— mR sin 0) Rd. 
0 
But sin 0/0 decreases in 0 < 6 < 4m and so sin @ > 26/7. Hence 


$n 
[Z| < 2M | exp (—2mR6/m) Rdé 
0 


4 
= 2M x | —exp (—2mRO ln) < ae. | 


Corollary. If, as |R| > «©, q(Re*’) +0 uniformly for 0 < @ < 1, then, 

as R> ©, 

| q(z) exp mizdz > 0. 
Tt 


Another useful lemma is relevant to example 2. 


Lemma. If f has a simple pole at z = a, with residue b, and y is the 

path z = a+de where a < @ < f, then the limit as 6 +0 of | S48 

ib(B —<). ; 
Proof. f(z) = — ?_+2(2), 

where |g(z)| < M. 


B 
— 
a Y 


yZ—a 
Solution of example 2. Let f(z) = Es 


let j= | f, where y is the semicircle z = de (0 < 6 < n), 
% 


i= | f, where I is the semicircle z = Re“ (0 < 6 < 7n), 
T 


—6é R e 
and let I = (| +{) SPE a, 
—R 8 x 
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By Cauchy’s theorem 11.63, 
I-j+J =0, 
the circuit being homotopic to a point and having index 0 for z = 0 
the only singularity of f. 
As 6 +0, the last lemma shows that j > 77. 
As R > oo, J > 0 by Jordan’s lemma. 


As o> 0 and R> 0, 1->2i | * dx 
0 


This completes the evaluation of the integral by the residue 
theorem. There are several alternative approaches, of which two 
have been given in §6.9 and §8.7; and we have referred at the end 
of chapter 8 to Hardy’s assessment of the various methods. 


eee where 0 <a <2. 
—o (e* + 1)? 

This example is chosen to illustrate (i) technique for a pole of 
order higher than the first, (ii) exploitation of periodicity in (part of) 
the integrand. 

Solution. Put ea 


f(z) = (e+ 12" 


Example 3. f° 


Integrate anticlockwise round the rectangle whose corners are 
+X, +X+271. 

f(x+2mi) = e7* f(x), so the integrals along the two horizontal 
sides combine to give 


: 5. 
(1 — e207’) | Sede. 


On the right vertical side |f(z)| = O(e¢-®*) and on the left 
vertical side | f(z)| = O(e-2*). As X > ©, the integrals along both 
the sides tend to 0. 

The singularities of f are double poles where z is an odd multiple 
of zi. The rectangular circuit has index 1 for mi and 0 for all others. 

Put z = m7i+¢ and expand / in its Laurent series in ¢. The residue 
is the coefficient of €-! and that is all that we need to know. 

We have, using a special case of theorem 5.44, 


frit D = e(14ab4+...)(-6-42-...)7 


= eam (at*e +.) 


Thus the residue is eani(g — 1). 
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Letting X —- oo, we have 
(1 — e227") { t(x)dx = 2nie(a-1) 


m(1 —a) 
sin a7 © 


as ieee | Feds 


Ifa = 1, the last line but one says 0 = 0. The integral is then element- 
ary, by the substitution e* = u, and its value is 1, which is in fact 
the limit of 7(1 —a)/sin az. 


x 
Example 4. { Pei —__ dx, where 0 < P< l. 


Observe that the conditions p > 0 and p < 1 are necessary and 
sufficient for the existence of the integral at 0 and oo respectively. 


The general type, of which this is a specimen is | x?—1q(x)dx, where 
0 


g(z) has no singularity on the positive real axis x 2 0. 

The feature of this example is that z?—1 is multiform, z = 0 being 
the branch point. To avoid ambiguity we must specify a branch of any 
multiform function and keep to it. 

We shall deal with this example 
exhaustively—by three different 
methods. 

Method (i). In the diagram the 
large and small circles have radii 
R, 6. The angle BOC is a, where 
0 < 2 <2. 

Let I’, be the circuit composed 
of AB, arc BC (anticlockwise), 
CD, arc DA (clockwise). 

Let I’, be the ree DC, CB (anticlockwise), BA, AD (clockwise). 


Define FZ) = or eq 


is a simply connected region containing [', and excluding the origin, 
in which f, is regular. By theorem 11.62, 


We now need a function fy 2 a —— containing I’, and agreeing 


with f, along CD. Define f,(z) = 


where z?~! has its principal value. Then there 


eo With ze) = pP-! exp (p — 1) Gi, 
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where z = re’? and (say) «—y < 0 < 27+7 (taking 0 < 7 < 4). 
The function f; is regular except for a simple pole at z = —1, where 
the residue is exp (p—1)zi. By theorem 12.5, 


fo = 2niexp(p—I1)z7i 
r, 


I 


— 27i exp pri. 


Observe then that, on BA, 


f(x) = we exp 2(p—1)zi. 


Take ; =e +| fe 


The integrals along CD, DC cancel. The integrals along the arcs 
of the outer circle are O(R®~) and along the small circle O(0”). 
Letting 6 > 0 and R > ©, we have 


=< Pe pt ee 
(1 — ent | 5 dx = —2mie?™ 
0 
: adil 7 
and hence { —. dx = —=——-.. 
o x+1 sin pa 


Method (ii). By the change of variable x = u? we have 


oa) y2P-l 
2 J Shoe 
The reader can verify that he can integrate the function z?”~"/(z? + 1), 
where z*?~! is suitably defined, round the circuit composed of 
—~R <x < —6,6< x < Rand the two semicircles with radu R, 6 
in the upper half plane. The residue at the pole z = is 
(1/2i) exp (2p — 1) 477, 

which is —4 exp pz7zi. 

Method aS The substitution x = e” gets rid of the multiformity 
of the integrand and yields an integral of the type of example 3. 


Exercises 14 (a) 


i dy ste F 
ite eS 


(Choose a circuit for your complex integral which includes one pole, not four.) 


1. Prove that 
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ae 
i, = [, oe (jt-s222, 3;.2.2): 


2. Evaluate 


Deduce that [,, > 1 as n > 00. Also prove this from the definition of J,, without 
calculating its value. 
3. Prove that, ifa>0,b > 0,a+b, 
| - dx m(2a+ b) 


oo (xP +a)22+52%)  2a*b(a+b)?" 
Is the formula valid fora = b > 0? 
4. Prove that, ifa>0,k > 0, 
= "sin Fy 
Se ee — —ka 
‘ caer dx = 47e-** cos ka. 


What is the value of the integral for other real values of k? 


5. In example 2 of §14.1, prove that, instead of using Jordan’s lemma for the 
semicircle z = Re? (0 < 0 < 7) we may integrate by parts along it. Also verify 
that the semicircle may be replaced by other paths receding to infinity, such as 
the rectangular path formed with y = 0 by the lines x = + R,y = R. 


6. Prove that, as X > 0, 


— ax—cos bx 


0 x? 2 


dx = 4nm(b—a)+O (x) (a >0,b6 > 0). 


7. Prove that, ifm > 0,5 > 0, 


i © (x?—q?) sin mx 1 
0 


d. 


ree ee ei =a 2 2) p—mdb __ 72 
x(2 + B) x ape 4 + b*)e a*}. 


8. By integrating (1 —e-*)/z round a suitable quadrilateral, or otherwise, prove 
that 
B e774? — e-Az A. e7% — e—Bz 
ay ee 
b Zz a zZ 


where a, A, b, B are complex, and the paths of integration are linear segments. 
Prove that, if a > 0,c > 0, 


© e-"_ cos Cx 
0 x 


c “sim cx 1 
dx = log-, i ax ==. 
a 0 = 


9. By integrating a suitable function round the unit circle, prove that 
{ 27 sin? 6 d0 


c i-memd+e ". eee 


Evaluate the integral for other real values of k. 


10. Evaluate (7 being a positive integer) 


27 27 
i e©89 cos (nd — sin 0) dO, | e°89 sin (nO —sin 0) dd. 
0 0 
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11. Prove that, if0 < a <1, 


ee) ax 
e 1 
—— dx = ——-. 
pe +1 sin 77a 


Deduce, or prove independently, that, if -1 < 5 < 1, 


© cosh bx ; 
I; mee dx ='47 sec 47, 


12. Prove that, if0 < a <1, 
fo @) e™* 
(P) | — dx = -—7cot7a, 
where the principal value (P) denotes 
—6é co 
lim {| + [ . 
é—0 —- 2 é 
Prove that, if —7 < c < 7, 


© sinh cx 
ie a ee dx = 4tan 4c. 


13. Prove that, if -1 <a<1l,a+0,0<9@< a], 


{ me x e 7m sinad 
ani PSA ie aS 54 = ares ° ° 
9 14+2x cos 6+x? sin 0 sin a7 


(Euler.) 


14. By integrating (log z)?/(1+2z?) round a suitable circuit, or otherwise, prove 


that 
else ze © (log x)” a 
{, ae ———,dx =0 and I wi dx = 3° 


15. If f(z) = exp 7iz?/sin 7z, prove that 
f(2)—f(z—-1) = 2iexp {7iz(z— 1)}. 


By integrating f round the parallelogram with vertices +4 + Rexp iri, where 
R is large, show that 


{ exp (—7?t*)dt = 1. 


(It was long supposed that complex integration did not help in evaluating 


| e~ dx. 


This integral has already been calculated on p. 285.) 


16. Integrate exp (— z?) round a sector of a circle to prove that 


cos x*dx = [ sin x2dx = =. 
j 0 . 2./2 


1 ay SS Baacas | 
Ded ee FEF ; 
educe that uP J (7) a = cos (or sin) uxdu 
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Exercises 17, 18 concern Fourier transforms (§9.9). 
co 
17. If d(x) = i exp (—4u?) cos uxdu, prove that ¢’(x) = —x¢@(x). 
0 
Deduce that exp (— 4x?) is its own cosine transform. 
Prove that x exp (— 4x’) is its own sine transform. 
18. Prove that sech x,/(47) is its own Fourier cosine transform, and that 
1 1 


exp {xy(2m)}-1  xV/(27) 
is its own sine transform. 


19. The Laplace transform. Suppose that 
Fe). = i: e~*! f(t) dt, 
0 


the integral converging absolutely if rez > c, where c > 0. Suppose that f 
satisfies sufficient conditions for the convergence at ¢ of its Fourier series. Using 
theorem 9.95, prove that, if a > c, 


at+tU 
H{f(t+)+f(t—)} = a lim e*'F(z) dz, 


lL U—-aw Ja-iU 


(integrating along the linear segment (a—iU, a+iU)). 


14.2. Summation of series by residues 
The theorem of residues 12.5 can often be used to sum series. To 
illustrate the method we shall prove that 
See ee 
n—-aeo(z—n)* — sin® 12’ 


. where z is any number not an integer. 

The gist of the method is that the function 7 cot 7€ has at each 
_ integer value of ¢ a simple pole with residue 1. Therefore the desired 
sum is equal to the sum of the residues of the function of € 


7 cot 7¢ 


AO) es 


at its poles other than the pole ¢ = z. 
Integrate f(¢) round the boundary @C,, of the square C,, with 
corners at (n+4)(+1+i). We shall prove in a moment that, as 


n —> 0, | f(€)d& +0; assume that this has been done. Then, 
Cn 


putting ¢ = z+a, we have 


1 = Nz 7" cosec? 7 WE 


IZtG)i mas on + 9(@), 
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where ¢(«) is regular at « = 0. So the residue of f(€) at ¢ = z is 
— 7m cosec? 7z. The sum of the residues at € = n for all integers and 
€ = z being zero, we have the required sum. 
(What this argument proves is that the principal value 
i N 1 772 
1 Seer 
i pe eo (z—n)? nj? sin? az 
But, since the sums s and > separately converge, the principal value 
0 — 
is otiose.) 
ft hos still tobe praveucthar | Fey de 0: 
OCn 
By a short calculation, 
cosh 2y +cos 2x 


RIT = och Fyne or 


On the vertical sides of @C,, if 7f = x+iy, then cos 2x = —1 and 
so |cot 7¢| < 1. On the horizontal sides sup |cot 7¢| tends to | as 
n> oo and so |cot 7£| < 2 (say). The length of 0C,, is 4(2n+ 1) and 


therefore 
[L048 < Gry 


This is a close estimate, but an O-argument would suffice. | 


4(2n +1). 


The reader would find it interesting to prove directly 
pe tetes 65 
(z—n)? sin? 2z 
by showing that the difference between the left- and right-hand sides 
(i) is regular except at the values z = n, 
(ii) is bounded as z > n, 
(ii) has period 1, 
(iv) tends to 0 as y > o, uniformly for —4 < x 
and then applying Liouville’s theorem 12.22. 


IN 


] 
29 


14.3. Partial fractions of cot z 


A polynomial p(z) having n zeros can be written as the product of 
factors 


A Il =a) 


Also a rational function g(z)/p(z) is expressible in partial fractions 
with the factors of p(z) as denominators. 
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A vital problem of complex function theory is the resolution of a 
regular function which has infinitely many zeros into the product of 
infinitely many factors. There is the associated problem of writing a 
meromorphic function in partial fractions, displaying its poles (in 
general a countable set). Questions of convergence will arise. Trigo- 
nometric functions provide good illustrations of these problems and 
we shall prove the formulae 


; = z 
Sin 7Z = 71Z — 
It ( =) 


m7 cot mz = - s (+7), 
eg) | \esnon 


where the dash after the & means that the term for n = 0 (which is 
undefined) is to be omitted. 

The latter formula will be proved directly. Infinite products have 
not yet been discussed and an investigation including the sine product 
as a particular case will follow in §14.4 and §14.5. 

The series summed in §14.2 may be written 


7 a = ] 
~ sin? ne Be 25) = ap (€—n)?" 

We shall integrate this equation along a path y in the ¢-plane from 
0 to z which avoids the points +1, +2,.... At € = 0 the left-hand 
side, ¢ say, is regular (or the singularity is removable). On the right- 
hand side the nth term is O(1/n?) and the series converges uniformly 
on the path of integration. 

The function ®(€) = 7 cot 7¢—(1/€), if ®(O) is defined to be 0, is 
regular at € = Oand ®'(¢) = 6(€) (¢ + +1, +2,...). Therefore 


[ ae - [ow = o@ 
and we have proved that 


l ee l 
7 cot ge = a (—+-). | 


The right-hand side has nth term O(1/n?) and converges absolutely. 
If we wish we may remove negative indices by combining the 
terms for +n and —a, 


co 
cot 7z = tla a 3a, 
2_y 


but the expression in first-degree partial fractions is more illuminating. 
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There is a — formula for the cosecant 


1 Sop (Le)), 


zn 


1 
Sin 7Z 


Exercises 14(5) 
Obtain the expansions in 1-3. 
2 (—1) (@+4 
. seCZ 2m, (ai peao ot 
= 1 


2. tan = 27 — 
Pan CES oes 


ot ok ea 
pint 2 ¢ eae nn 22+ 4n2n" 


Carry out the summations in 4-7. Observe that, in many such questions, the 
methods of chapter 9 (particularly §9.8) provide alternatives to those of §14.2. 


4 — aoe 1 m+¥v3 sinh 4my/3)_ 
i 64n°—-1 2 12 cosh 47,/3 
5. : rer wots _ ats k?n? — a coth kn (k + 0). 
Ss the" Ze + ap oot (” | ) pce cena 
¢ # ae ya = pr (-7<a<7,x +n). 


Exercises 8, 9 refer to Bernoulli numbers which figure in a number of formulae 
in analysis (e.g., in theorem 14.86). 


8. The Bernoulli numbers B, are definied by 


Prove that 

(i) this series has radius of convergence 277; 
(2, = 1, B, = 0 if nis odd; 

(iii) the sequence of B,, satisfies 


= 2 
-? "Ba, ( SS = 2n+1; 


r=0 2r 
(iv) B, = %, By = —30, By = ¢z, Bs = —35. 
9. Use 3 to prove that Bop 2(2k)! Sox 
(277)** 
where 52; = —— “al? and that, for large k, 
2(2k)! 


| Box| ~~ (27)* e 
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14.4 Infinite products 


The statement in §14.3 of the factors of sin 7z was the first mention 
in this book of an infinite product. We must lay down the necessary 
definitions and prove theorems about convergence or divergence. This 
can be done briefly since no new principle is involved and infinite 
products are closely linked with infinite series. 

Write = 

A, oe i (I +4,). 


Definition. If, as n tends to infinity, A,, tends to a limit A, not 0, we 


say that the product JJ (1+a,) converges and has the value A. 
r=1 


Definition. Suppose that 1+a, = 0 for at least one r < m and for 


c 
nor > m. Then the product {J (1 +4,) is said to converge to zero if and 
r=1 


co 
only if J] (1+4,) converges to a non-zero limit. 


Tr=m 
[o@) 
Definition. If T{ (1+a,) comes under neither of the last two definitions, 
r=1 


it is said to diverge. 
It pays to describe as divergent a product for which A, tends to 0, 


for instance (1-2) (1-2) (1-!) 
5 3) 00 oe 


where A, = a ss = = 3 The reason, which will become plain 


in the theorems that follow, is that the behaviour of the product 
II (1+a,,) is linked with that of the series Xa,,. 


Lemma. Inequalities between products and sums. Let a, > 0 and 
Syn = a,+...+a,. Then 


(1) [1 (1 +a,) > 1+5,. 
1 
If, further, a, < 1 jor1 <r.<.n, 


Q) Ida) > 1-5, 


G) Id-4) < 775. 
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If, further, s, < 1, 


(4) Il (+a,) <7 


nr 


Proof. SoBe gives (1) and (2). To deduce (3) and (4) use 
(1+a)(1—a) < 1. | 


Theorem. 14.41. (1) If a, = 0, then Il (1+a,) and Xa,, both converge 
or both diverge. 

(2) If 0 < a, < 1, then 11 (1—a,) and Xa, both converge or both 
diverge. 


Proof. Use the appropriate inequality from the lemma. For: 
instance, in (1) suppose 2a, convergent. 
Choose m such that 


eg NS ore oe 
By (4) of the lemma, 
(1+@msi1)---1+a,) < 1/(1—?). 


The left-hand side is a bounded increasing function of n. Hence the 
product converges. | 


Theorem 14.42. If a, > 0, then Il (1+a,) is independent of the order 
of the factors. So is Il (1—a,) if0 < a, < 1. 


Proof. Use monotonic sequences as for the corresponding result 
for series (e.g. C1, 94). | 


Theorem 14.43. A necessary condition that Il (1+a,) converges is 
that a, > 90 asn-— oo. 


Proof. 1+a@, = An/An— and A,-1, A, both tend to A (not zero). | 


Definition. (Absolute convergence.) The product II (1+4a,) is said to 
converge absolutely if II (1+ |a,|) converges. 


Theorem 14.44. If 11 (1+a,) is absolutely convergent, then (1) it is 
convergent and (2) its value is independent of the order of its factors. 


Proof. Write |a,| = b,, An = Il (1+a,), B, = ll (1+46,). 
1 1 
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Then 


Ave —1| = ita) ba) 1 
< (+8, (145,0)=1 


aioe, 


B,, 
Also |A,| < B,. Multiplying the two inequalities we have 
Mais a A,,| < By iy oe B,,. 

By hypothesis the B,, form a Cauchy sequence. Hence so do the A,, 
and (1) is proved. 

Let A, > A and B, > B. 

Let II (1+a,) be a rearrangement of II (1+a,) and II (1+6,) the 
same rearrangement of II (1+8,). Let A; and B, be the respective 
products of the first n factors. By theorem 14.42, B’ = lim B,, = B. 

Given n, take g so large that A, contains all the factors in A, (and 
so Bo those in B,). 

Then, if r 2 q, 

|A;—A,| < B;-B, (by the proof of (1)) 
< B’-B, = B-B,, 
and so 
|A—A,| < |A—A,| * i44—4,| < |\A—A, + |B—B,,|. 
Therefore A, > A. | 


In what follows we shall connect the logarithm of a product of 
factors (assumed not zero) with the series of logarithms of the 
separate factors. This is an easy exercise if the factors are real, but 
more delicate if they are complex. Each logarithm will be supposed 
to have its principal value. 


Theorem 14.45. ll (1+a,) converges if and only if, for some m, 
1 
Sy log (1+4,) converges. 
m+ 
Proof. If > log (1+4,,) converges, to sum /,, say, then (since the 
m+1 


exponential function is continuous) [J (1+a,) converges to exp /n, 
m+1 


and Il (l+a,) = 1+a@,) ...(1+a@,) exp I. 
1 
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The converse needs more care because, with principal values, 
log zw need not be equal to log z+log w but may differ from it by 
2mi. However log zw = log z+log w if |phz| < 47 and |phw| < 37. 

Suppose that II (1 +4a,) converges. 

Then, given ¢(< 1), there exists N such that 


Ado AGMA Se 4) 


N+p 
He (i +a,)—1 


for alln > N and all p > 0. 
If |z—1| < € < 1, then (as is clear from a diagram) 


l—e < (2) < 1 +2, (14.42) 

and |phz| < arc sine < 476 < 47. (14.43) 
n+ n+ 

coceike es = oe (14.44) 


or the two sides differ by a multiple of 27i. We prove equality for 
n> Nand p 2 0. 

Let n > N. Then (14.44) holds if p = 0. Suppose that it holds for 
p=4q 2 0. By (14.41) and (14.43), 


n+q 
ph iT 4a)| < ae igh cao ioeae. 


nt+qt1 n+q 
Hence log [I (1+a,) = log II (1+a,)+log(1+4n+qi1) 


n+q+1 
= © Jogi +a). 


Thus, by induction, (14.44) holds for all p > 0. 
Finally, (14.41)-(14.44) show that, for fixed m, the function of n 


defined by ¥ log (1+4,) is a Cauchy sequence. | 


Theorem 14.46. If, for a fixed positive integer k, X|a,|**! converges, 
then IL{(1+a,) exp (—a, +4a%,—...+(—1)*an/k)}} converges 
absolutely. 7 
Proof. The nth factor is 
exp {log (1+a,)—a, +4a2,—...+(— l*an/k} 
exp {O(|a,.|"*)} 
1+ O(|a,|**") 
= 1+5,, 
where = |b,,| converges. | 
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Theorem 14.47. If Xa,, Xa, ... Xa and X |a,|**1 converge, then 
Il (1 +4a,,) converges. 


Proof. As in the last theorem, 
log (1+z) = z—4224+...4+(—1)*-12*/k + O(|z|**4). 


In this write z = @, dg, ... d,, ... and sum. The hypothesis shows 
that = log (1 +a,) converges. Apply theorem 14.45. | 


The reader should be able to define uniformity of convergence of 
products II {1+/,(z)} and to prove theorems which naturally arise, 
such as the analogue of the M-test (theorem 5.32). 


14.5. The factor theorem of Weierstrass. The sine product 


Theorem 14.51. Given a sequence of complex numbers ay, do, ..., 
whose only limit point is 0, there is an entire function with zeros for 
these values of z and no others. (a, is repeated k times if the zero there 


has order k.) 


Proof. Assume for the present that no a,, is 0. 
We may suppose that 


|a,| < |a.| < |as|..., with |a,| + 00. 
If the product ll (1 -2) 
1 n 


were convergent, it would define the entire function sought. In 
general the product does not converge and we have to affix convergence 
factors.to the factors of the product in the manner of theorem 14.46. 
For shortness write E(u, 0) = 1—u, and 


E(u, p) = (1—u) exp (ut+ 40? +...+u?/p). 


It is always possible to find a sequence p, to make = |z/a,|?-+! 
converge for all values of z. In fact p,, = n would do because, given 
z, if n is large enough, |a,| > 2|z| and |z/a,|" < 1/2”. 

The argument of theorem 14.46 proves that, for a fixed R, there is 
a constant A such that 

|E(z/ay, Pn) = | S A |z/a,,|Patt, 
for |z|<R and all n. 


Then f(z) = I E(z/a,, P,) converges for all z, and the convergence 
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is uniform for |z| < R, however large R is. Theorem 13.11, shows 
that f(z) is an entire function. It has zeros at a, and nowhere else. 

At the beginning of the proof we ruled out the possibility a, = 0. 
If the function is to have a zero of order k for z = 0, then we adjoin 
a factor z*. | 


Notes. (1) The choice p, =n 1s extravagantly large. In many 
common applications p, can be taken to be a suitable constant (often 
1 or 2). 

(2) If f and f, both satisfy the requirements of the theorem, then 
A. /f, having removable singularities at the common zeros of f and /,, 
is an entire function, say g, without zeros. The function g must be 
of the form exp h(z), where A(z) is an entire function. For g’(z)/g(z), 
being entire, is the derivative of an entire function A(z). Then 


£ (ete) exp (—A(z))} = exp (—A(2)){g'(2)—-g(z)h'(2)} = 0 
and hence g(z) is A exp A(z). 


oe) 2 
Theorem 14.52. sinnz = nz TI (1 -7). 
n=1 
Proof. In theorem 14.51, a, = +n, and 2% (1/n?**) converges if 
p = 1. So we have 


“gs eed = 
sin 7Z = g(z)7z I (1 “| exp ‘I, 


where the dash in [[’ (or in - denotes that the term with n = Ois 


= 59 


omitted. Here g(z) is an entire function without zeros, to be found. 


Writing Sy(Z) = g(z)7z ng (1 -?) exp ‘I 


ae 1 
we 40 BBP le) 
we have Sy(Z) eRe +— +e a. Sy(Z), 
where z is not an integer. 
The proof of theorem 14.51 shows that s,(z) converges uniformly 
to sin 7Z on any compact set. From theorem 13.11, sy(z) converges 
to 7 cos mz. Therefore 


ee | ee 1 gd 
a-cot-nZ.= aC stot (+5) 2 Od 8): 
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Collating this with the formula for cot 7z found in §14.3, we 
deduce that g’(z) = 0, and so g(z) is a constant A. As z>0, 
sin 7z ~ mz and therefore A = I. 

Multiplying the factors containing +m and ~—n, we have the 
result. | 


Exercises 14(c) 
Examine for convergence the products whose th factors are given in 1-6, 


sin (z/n) 


1. 
z/n 


a2+n 
a B+n 
3. cosh =. 
n 
ee 
4. 1+(-—1)" sinh = 


(1 +2")? 
1+22" ° 


‘ 1-(-"2") 


Prove that II (1 . diverges and II | 1+- 


7 converges. 


8. Prove that, if |z| < 1, the infinite product 


(1+z)0 +27) + 24)(14+ 2°)... 
converges to 1/(1—z). 


9. Prove that, if |z| < 1, the product of the three infinite products 


oO ie.8) 


Hd+29, Ha+") He-e4 
1 1 1 
is 1. 
10. Prove that, if |z| < 1, 


[o.0) - Zz Zz? z3 
Hass ) = exp | ete} 


11. Express cos 77z in factors, on the model of theorem 14.52. 
12. Prove or — the following statements for real @,, by. 
nN fee) 
(i) If lim [[ (+a,) = 1, then [[(1+a,) converges. 


n—>Or=n 1 


co eo 
(ii) If lim (@,/b,) = 1.and J] (1+5,) converges, then [| (1+a,) converges. 
n—>oO 1 1 


(oe) n 
(iii) If 5} a, converges, then [] (1+4,) tends to a finite limit as n > 00, 
1 r=1 
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13. Prove that, if p runs through the sequence of prime numbers 2, 3, 5, 7, 11, 
and rez > 1, then 
ai) [] ( eka. = ——, where the Riemann zeta function {(z) = > —, 
p Pp . C(z ) n=1 n* 
and the zth powers have their principal values. 
Prove further that 
(ii) ¢(z) is never O for x > 1; 


(iii) >) (1/p) diverges. 
p 


14. Suppose that a, > 0. Given a sequence b,, construct an entire function f 
such that f(a,) = bn. 


15. Elliptic functions. In §14.2-14.5 we have used trigonometric functions 
(periodic functions) to illustrate the text. The methods and theorems can be 
applied to construct functions which possess two independent periods and to 
develop their properties. 

Let w,, ©, be numbers such that ph(#,/,) + 0. Write Q,,,, = 2mw,+2nw,, 
where m, n can be any integers (+, — or 0). Let &’ and II’ denote a sum anda 
product over every pair of integers m, n except m = n = 0. From exercise 4(g), 11, 
x’ O-7*, converges absolutely. 


(i). Prove that 
, Zz Zz _ 
o(z) = zil (1-5) exp ee +302 ie} 


is an entire function with a simple zero at each point Quy, n. 


(ii) Prove that s 
wo) te | 1 1 =| 


ie fener Maa OL. 


is regular except for a simple pole at each point ©,,,,. The function o’/a is called 
¢, but it has nothing to do with the zeta function of Riemann in 13. 
(iii) Prove that the function @, defined by 


((z) = —S"(z) = ath! a _ a 


is regular except for a double pole at each point ©, ,. 
In (iv) we shall outline the steps of a proof that 


((z+20,) = (2) 
and §2(z+ 202) = (z), 


that is to say, the function @ is doubly periodic, having 2m, and 2w, as periods. 
We observe that a doubly periodic function which is regular for all finite z is 
bounded and therefore, by Liouville’s theorem, a constant. For_historical 
reasons, a doubly periodic function which is meromorphic in Z is called an 


SO Prove That | = 
iv) Prove that (if z + Qn, n) 


(a) £2 is an even function, 
(6) its derivative 40’ is odd, 
(c) 9’ is an elliptic function, 
(d) §2 is an elliptic function. 
Elliptic functions have a multitude of properties; the prevailing method of 
proof is by Liouville’s theorem. As one illustration we ask the reader in (v) to 


Y 


4 y 
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construct the differential equation for (9(z). For anything more we must refer him 

to books containing systematic accounts. C—O E. 
(v) The differential equation for f(z). 
Establish the Laurent expansion 


§(z) = z-2+ agz?+a3z* + O(|z|), 


with constant term 0, in a disc centre z = 0. Choose coefficients g2, g; so that 


{Q’(z)}? = 4G(z)}>— g2§(z)— 83+ O(|z|*) 


and deduce the differential equation. 


14.6. The gamma function 


The gamma function I'(1 +z) extends the factorial n! to any z, real 
or complex. We acknowledge that Jeffreys and Jeffreys are right in 
contending that the name ‘factorial’ and the symbol ‘!’ ought to 
have been used for the extension and that a new notation was otiose. 
But we lack the courage of their convictions and continue the usage 
established by Euler and Legendre. 

Three definitions are possible, all of which are useful: 


E(Euler) [(1+zZ) = A Ped ez >-=F, 
0 


where /* means exp (z log 2), the principal value; 


n! n? 

G (Gauss) rd +2) = = (z+1)(¢+2)...(Z+n)’ 
where z is not a negative integer and n’ has its principal value; and, 
thirdly, a definition of the reciprocal as a product of the type given 
in theorem 14.46, 

1 _ Zz 
— z ae —2/n 
W (Weierstrass) Td +2) ev J] (1 +=) e , 


n=1 


where y is Euler’s constant (namely lim 1% (1/r)— log n|) ; 
F 1 


It is easy to reconcile the second and third definitions G and W. 
The right-hand side of W is 


lim [exp (1+3+- a logn)} I Il 7 a e-=r| 


os puey: 


= nats ela tay) 


path nt 


which is 1/I'\(1 +z) according to the definition G. 
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Observe that theorem 13.13 shows that Euler’s integral defines 
T(1+z) as a regular function for rez > —1. By theorem 14.51, W 
defines 1/[(1 +z) as a regular function for all z with simple zeros at 
z = —n. So I(1+z) is meromorphic, regular except for simple poles 
at z = —n. Thus G and W will give the analytic continuation of the 
function defined by E into the whole plane Z with the negative 
integers removed. 

We have still to reconcile the Euler and Gauss definitions. Since 
they both define regular functions in their domains of existence it is 
sufficient to prove equality for some interval on the real axis, say 
0 < x < 1. Byn integrations by parts, we find that 


* di n+ n! 
{. , ( -‘) ae (x+1)(x4+2)... (xtn4+1) 


and the right-hand side is the Gauss expression multiplied by the 
factor n/(x+n+1) which has limit 1. So what we need to prove is the 
following theorem about the limit of an integral. 


Theorem 14.61. If 0 < x < 1, then 
lim “# (1-4) dt =(- Feo *dt. 
0 n 


n—>0o 0 


Proof. We show first that, for a fixed positive ¢, 


(sy 
n 

increases with n. 
In the straightfoward inequality (C1, 17, exercise 1 (d), 8) 


1 —q”*} 3 1 — aq” 


a ee (OO <a<l) 
substitute a” = i] We have then 
n+l 
1—- <a! (0 <t <n) 
t n | , n+1 
and hence (1 -) < (1 4) ; 
n n+1 
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Given €, choose m such that 
| t~e-*dt < €. 
m 


In [0, m] the increasing sequence of continuous functions 


te (1-)" 
n 


has the continuous limit t*e-, 
By theorem 5.35 this convergence is uniform and by theorem 5.22 
we can choose m, > m such that, ifn > no, 


0 < < [re —t dt — i (1-")"at <e, 


Hence, for n > no, 


fe f“e-tdt—2e < Ee (1-£)"ae 
0 0 n 
<|"r (1-5) at 
0 n 
< | re-tar, | 
0 


In proving properties of the gamma function we can use whichever 
definition best serves the immediate purpose. The following results 
are basic: 

(1) Td+2) = zF), iD =1, l@+p =n 

(Prove from either ae or G.) 


(2) T@)Td-z) = 
IQ) = 7. 


Proof. Multiplying the two W products we have 


Siti 7Z~ 


rasarace ~ B('-m) 


1 _ sin 7z 
zI(z)T(1—z) — 


For the value z = 4 note that (4) is not negative. 
(3) The duplication formula — 


T(z) l(z+4) = 7#2!-1(2z). 


and so (from theorem 14.52). 
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Proof. From G, the left-hand side is 


nn! n*-*n! 
© az 1)... .(z+n—- it ™ G+hHE+D wee (Z+N—4) 
= n®-$ (n!)222" 
~ (92) (2z+1)(2z +2) ... (2z+2n—1)" 
From the Gauss formula for ['(2z) with 2n in place of n we find 


a (2n)*-(2n) 
T(2z) = lim (2z)(2z+1)(2z+2) ... (2z+2n—1)° 


Hence T(z) l(z+4) = A2!-“1(2z), 


(n I)? Jan 


where Ae im (Qn) ink’ 


The value of this limit is probably known to the reader (A = ,/7, 
Wallis’s formula for 7, e.g. C1, 135); without assuming this know- 
ledge we have only to put z = 1 in the preceding line, obtaining 
T(3) = 4AT (2) and so A = IB). 
(4) The duplication formula (due to Legendre) is the most useful 
case of the general multiplication formula (due to Gauss) 


Fey (: + ‘) re (2 7 = (27)#"-Dyt-rT (nz). 
This may be proved by the same method. 
(5) As an application of gamma functions we use them to evaluate 


the infinite product J] u(n) where u() is a rational function of n. 
n=1 


If the product is to converge, the degree inv of the numerator of 
u(n) must be the same as the degree of the denominator. The method 
will be clear if we take the degree to be 2 and express, in terms of 
gamma functions, the product 


pe % aden) 
~ n=1 (n—b,)(n— by)’ 


where no a or bis a positive integer. 
The nth factor of P is 


a oe b, +0(). 
n n* 
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Hence a necessary and sufficient condition that the product P 


nverges is tha 
i : 8 t a,+a, = b, + be. 


The nth factor can then be written 


(1 ae “1 ea/n (1 oa “:) em/n 
n n 

(12)ovm (1) om 
n nN 


The W definition shows that 


p = Ll-by)TG — 45) 
TG —a) tim) 


Exercises 14(d) 
Find the values of the infinite products whose nth factors are given in 1, 2. 


1.1-—n* (n2=2) fork = 1, 2, 3,4. 
2. Lne*. (n.21) fork = 1,2, 3. 
3. Prove that 


ce See Pe os Pe 
ae — Dati -yy) eo 
4. Prove that [Cd + iy)|? = zy/sinh zy, 


[I'\(4+ iy)|? = 7/cosh my. 
[T'(iy)|? = 7/y sinh my. 
5. Prove that | 
a—2(145) (1-3) (144) ~ = pasadraaasy 
What is the value of the product if the factors are rearranged so that two with 
a — precede each factor with a +? 


6. Find the residues of I'(1 +z) at its poles. 


14.7. Integrals expressed in gamma functions 


Many common definite integrals can be evaluated by gamma 
functions. 
Define the beta function by the equation 


1 
B(p,@) = | -xP-(1 —x)Ads, 


the integral existing if rep > 0, req > O. 
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The following properties (i) to (iv) are elementary. 
(i) B(p,q) = BG, Pp). 

(ii) pB(p,q+1) = qB(p +1, 9). 

(iii) B(p, gq) = B(p+1,q)+ Blip, q+ I). 

(iv) pB(p, q) = (p+q) B(p +1, 9). 


Theorem 14.71. Bp, gq) = Te =~. 


Proof. In order to avoid unbounded integrands we shall suppose 
that rep > 1 and reg > 1. The property (iv) above will then extend 
the theorem to any p, q with positive real part. 


{ ety { e-vyth dy 
0 0 


Pp) Tq) 


: A A 
lim dx | eft) xP— yr-tdy 
0 


A—o, 0 


im || erin dx dy, 
= S(A) 


where S(A) is the square with opposite vertices (0, 0), (A, A). The 

line joining the points (A, 0) and (0, A) divides the square of integra- 

tion into two triangles, the south-west one A(A) having its third vertex 

at (0,0). In A(A) make the transformation x = u, y = v—u (for 

which a(x, y)/@u, v) = 1) and {| becomes an integral over 
A(A) 


0 <u<v< A. Expressed as a repeated integral it is 


A v 
{ dv | e-’yP-l(p — u)2 du. 
0 0 
The substitution u = vt in the inner integral gives 
A 1 A 
{ do { e*ppta-1pp-1(] — t)¢-1dt = Bip, a| e-*pP+4-1 dy, 
0 0 0 
If p and g are real (which is the usual case) it is simplest to use the 


since S(4A) < A(A) < S(A) and the integrand is positive. 
As A > 00, each of the integrals over the squares tends to Pp) Iq), 
and the integral over the triangle to B(p, qg) (p+). 
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If p and q are not real, we cannot assert the preceding inequalities, 
but we shall prove instead that the integral over the north-east 
triangle (A, 0), (A, A), (0, A) tends to 0 as A > oo. 

If p, and q, are the real parts of p and gq, the integrand has modulus 
at most e-4x?:-ly4-1 and the integral has modulus at most 
e~4A”1+41/p,q;. This tends to 0 as A > 00 and the theorem is proved. 


To illustrate theorem 14.71 we add an alternative proof of the 
duplication formula (§14.6, (3)). 
We have, x being real and positive, 


ee ~ | t-(1 — 1). 


Put ¢ = 4(1+4u). The right-hand side is 


1 1 
P sid | (1+u)*“(1 —u)*-1du = 22-2 | (1 —u?)*— du. 
=’ 0 
Now put uw? = v and we have 


1 
oo | (1—v)*"1v-4dv = 2!-2*B(x, 4) 
0 


eee 
I(x +4)° 


This proves the formula for positive x. The left and right sides of 
§14.6 (3) are regular throughout the z-plane except at their poles. 
Theorem 12.13 shows their equality. | 


Theorem 14.72 illustrates the expression of a multiple integral in 
gamma functions. 


Theorem 14.72. (Dirichlet’s integral.) 


PD)T@PO)TE) 


o —lyer-1f/] _ y_ 1) _ »\s—1 a 
[[[ ytt2td —x—y—z)1dxdydz irEbree> @ 


where p, q, r, 8 are greater than 0, and A is the tetrahedron bounded by 
the planes x = 0,y = 0,2 =0,x+y4+z = 1. 


Proof. In order to have a continuous integrand we prove the 
result when p,q, 7,5 are greater than or equal to 1. The extension 
to values between 0 and 1 would require a discussion of multiple 
integrals of unbounded functions. 
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The neatest method is to write 
xty+z=& ytz=&y, z= &n6, 
so that x= 1-7), y = &y(1—9). 


The x, y, z tetrahedron is then seen to be mapped onto the cube in 
which each of £, 7, ¢ is between 0 and 1. 

The Jacobian @(x, y, z)/a(E, 7, $) is £7. 

The integral then becomes 


1 1 
| dé | an | Eptatr—(] — Ep tart] — P11 — fet de, 
0 0 0 


which is the product of three single integrals in &, 7, ¢. 
Apply theorem 14.71 to their product 


B(p+q+tr, s)Bq+r, p) Br, 9): | 


Exercises 14(e) 
1. Ifa>0,b> 0,k > 0, express in I functions 


(i) [, xet(1— xy a , 


fia + x)2-11 — Be ages 
Gi i) 3 a (14 x2)et8 dx 


2. Express in terms of I'(4) 


: iad 
© |) yarrsaray 


= j, Es ipa) & 


3. Prove that, if 0 < a < l, 


foe) 
. y*-! cosydy = I(a) cos 47, 


{ y? sinydy = I'@) sin 47a. 
0 


4. Express the content of the unit ball in R” as 

m/T(4n+1) 
and show that this is the value obtained in exercise 8(/), 12. 
5. Prove that, ifb >a> —1, 


mI\(a+1) 


4a 
. bee bese = ee 
[, pos! UNOPS sata ibe Ge ibs 
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6. Prove that, if rez > 1 and &(z) = }} n~, 
1 


co t?-1 
[, 3a tt = TO. 
14.8. Asymptotic formulae 

The object of this section is to develop methods of approximating 
to (1 +z) for large |z| by formulae suited to numerical computation. 

Stirling’s formula for the factorial 

n! ~ /(27)n™+4e-" 
is likely to be already familiar to the reader. (A proof is outlined in 
C1, 148.) Our first theorem is a refinement of the formula. 
Theorem 14.81. If n is a large positive integer, 
Ee \ ants p—n ( — =) 
n! = ,/(27)n"+te ' + 17, +P (5 


Proof. The argument is elementary and we state it without 
comment. 


1 log xdx —i{log (n—1)+log n} 
a | 


n 


= nlogn—(n—1) log (n—1)—1-—4 log (n—1)—4 logn 


= —(n—4) log (1-) —] 


ee I | 
as eee -Se tae eae as anes 
l | 
Bie ee (3) 


Summing from 2 to n, we have 


"log xdx— > logr+41 Sie 
where A, here and in the following lines, is a constant. It is not 
necessary to put suffixes A,, A,, ... to denote that the constants in 
successive steps may differ. 


The last equation is 
i. 35 | 
1 ome — ! —- — —_ — pa 
(n+4) logn—n-—log (n!) = A 3 atO (-:) : 
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Since DAS < £ < eel em 
(rt+iy 7? r(r—1)’ 
] bile | ] 
we have ea < aA < 2 
and hence > * ss lo (-:). 
atire n? 


Thus we have proved that 


ri a | I 
! coed 1 ae “a sal 
log (n!) = (+4) logn nt Ati +0(4) 


and therefore n! = An"+#e- exp i +0(;3)| 
1 l 
= Anrtte-n = 
An"*¥e oe +0(53 ak 


To find this unknown constant A we substitute the formula we have 
just obtained in Wallis’s formula (C1, 134) 
a5 See 
sce aa 
and we obtain A = (27). (Compare the evaluation of the constant 
in the duplication formula §14.6 (3).) | 


We take up now the more difficult problem of approximating to 
I'(1 +z) when the complex number z has large modulus. On account 
of the poles at negative integer values of z, the approximate formula 
must be restricted to a sector 


—m+0 < phz < 7-06. 


We shall ultimately prove that the formula to be stated as theorem 
14.86 holds in the smaller sector 


—47+6 < phz < 47-4. 


The approximation will take the form of an asymptotic expansion 
(or series) according to Poincaré (1854-1912). The formal definition 
will be easier to grasp in the light of a simple illustration. 


Example. Prove that, for positive x, the function f defined by 
—at 


ase 
y= 1 To 
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satisfies F(X) = Sn(x) +7n(X), 
See as (2n—2)! 
where cine = pete : 5 ani 
2n)! 
and I7n(x)| < oy ° 


Hence approximate to f(x) for large x. 
Solution. Substitute 
oS Bion 1—72+ +(- 1)" 1" + (- 1)” Cae 
L470 get 
in the integral defining f. We have the required s, and also the 


inequalit 
— eee ee 
lrn(X)| = Ne PT ee: ee 


The remarkable property of the decomposition 


F(X) = Sn(X) +7n) 

is that, as n> oo, the sequence s,(x) diverges for every x. The 
remainder r,,(x) is, however, of the order of magnitude of the (n + 1)th 
term of the expansion (in this example, numerically less than it). If 
x is large, the terms of s,(x) initially decrease in magnitude and the 
value of f(x) can be computed with high accuracy by stopping at the 
(2n)!: 
xentl 

We now define formally an asymptotic expansion (or series), and 
in theorems 14.82 and 14.83 we establish properties that we shall need. 


with smallest value. 


term 


Definition. Let s,(z) be the sum of the first (n+ 1) terms of the series 
1, {an 
| Ag tT teeta tee (14.81) 


Let r,(z) = f(z)—s,(z). Then the series (14.81) is called an asymptotic 
expansion of f(z) for a < Phz < P if, for each fixed n, 
lim z"r,(z) = 0, uniformly in Phz. 
|z|—>00 
This definition applies to a power series in 1/z which converges 
for sufficiently large |z|, say for |z| > R. For then there is a constant 
M, depending on R only, such that for all values of Phz 


MR*+1 
a2) < (Ray 
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The notation f(z) ~ /a,z~” is commonly used for an asymptotic 
0 
expansion. As the symbol ~ has been given a different meaning in 


§4.1, we shall refrain from using it now. 


Theorem 14.82. (Uniqueness.) (a) For a given interval of Phz a 
function cannot have more than one asymptotic expansion. 

(b) A series can be the asymptotic expansion of more than one 
function. 


Proof. (a) Suppose that, for « < Phz < f, 2a,z™ and &b,2™ 
are both asymptotic expansions of f(z). Then, for fixed n, as |z| > 00, 


(Ao =, bjs" + (a,— bz = eee a (An EPS, 1) — 0. 
It follows that ay = bo, a, = by, ... An = 5» (for every n). 
(b) The reader can verify that an asymptotic expansion of f(z) for 
—in < phz < 47 is also an asymptotic expansion of f(z) +e. | 
Theorem 14.83. (Integration term by term.) If, fora < Phz < £, 


ay ay 
a eee +... 
ot St. ton 


is the asymptotic expansion of a regular function f(z), then 


is the asymptotic expansion of { | fo) —a~ 4 db, where y is a 
= 


suitable path from z to © within the sector a < Phz < B. 


Proof. Write f(z) = folz)—ay- 2 
and 52) = G+..+2, f@ = 2 +r) 


From the definition of an asymptotic expansion, given e, there is 
Z, such that 
[f(z)—s,(z)| < e|z| for |z| > [zo], % < Phz < £. 


Let 0 be fixed in « < 0 < f. Let y, be the linear segment ¢ = pe” 
from z to z’ where r = |z| > |Z. 
Let y be the infinite extension of y,, when r’ = |z’| > ©. 
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For Cony, 
I(Q)—-s,(E) = O(/E|-”), where n > 2, 
and S(¢) = O(|¢|-*). 


Therefore | {f(Q)-—s,(O}de and i s.(G)adC: exist, and hente 
fa Y 


| S()dé also exists. Moreover 
Y 


[ 1O-sohat 


dp _ a 
. “|S ~ (n—1) |Z)" 


This is the condition for 


a 


(n—1)z"-1 a. 


to be the asymptotic expansion of S(S)d€. | 
$2 


We need a number of properties of the gamma function. 


Theorem .+ 4. If F(z) = Tr af 


where z is not a negative integer, then 


(l) A) = -+ 3 (F-—), 


n 29H 


I 
+ ea 


(2) F(z) = 


(yf @) -|'2 ae - dt (rez > —1). 


Proof. (1) In the W definition of 1/I'(1+z) the infinite product 
converges uniformly on any compact set not containing any of the 
points —1, —2,... (by the analogue of the M-test, since the factors 
are 1+ O(1/n?)). Writing 


s{2) =e" I | (1 +?) el, 


we have, by theorem 13.11, 
Et 1 1425 = Fiz): 


Sn(Z) > miviec) oe res 
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that is, uO > F(z). 
But Si = ty + By (=; _ *) Sn(Z)s 


and we have proved (1). 

(2) We again appeal to theorem 13.11. 

(3) The reader will have observed that the series & (z+n)~? is of 
the type discussed in §14.2. We could find a formula for the sum by 
integrating (z+ ¢)-? 7 cot 7€ round a sequence of suitable circuits C,, 
enclosing the points ¢ = 1,2,...m. We can however obtain the 
integral in (3) much more simply as follows. By theorems 13.11 and 
13.13 the series and the integral on the right-hand sides of (2) and 
(3) are regular if re z > —1. If we prove them equal for real z (say x), 
then theorem 12.13 shows that they are equal for rez > —1. 

Write f,(t) = te**+™. Then f,(4) 2 0 for ¢ > 0, and 


th) = 
LAO = 3-4 
Since this sum is continuous, by theorem 5.35 the convergence of 


> f,(0) is uniform in [8, T] for 6 > 0. By theorem 5.22, 
1 


["Baola-§ {f'n} 


We now see from theorem 6.33 (for T > 00) and its analogue (for 

6 0) that = Ss = 1 
—tz { —nt oe ewes 
3 te ex, Jat x ae | 

Next we explain a widely applicable method of obtaining asymp- 
totic expansions of functions expressed as definite integrals. 
Theorem 14.85. (Watson’s lemma.) Suppose that for —A, < t < A, 

I (t) 2g ~ a,t", 
and that | f(‘)| < Be (b > 0) for - A, where 0 < A < A. Then, if 
|phz | < Sa 31 =, 20 
F(z) = { e~* F(t) dt 
0 


odcnt 


has the asymptotic expansion 2 ntl” 


450 APPLICATIONS TO SPECIAL FUNCTIONS [14.8 


We are to prove, in fact, that substitution of the power series for 
f(t) and integration term by term gives an asymptotic expansion for 
F(). 


Proof. Given N, there exists C such that 


< Cit%e™ for all ¢ > 0. 


N-1 
fO- ay” 


If we now write 
F(z) 


N-1 
py eta, t"dt+ Ry 


N-1 ! 
a,n: 
aig ” cari + Ry; 


the theorem will be proved when we have shown that Ryz%*? is 
bounded. If z = x+iy, 


ea) N-1 
Rul = [fe {AQ "E ayer} 
0 0 
< . ext CN ebt dt 
0 
CN! 


= (x —b)**1 
Since |phz| < 427-6, we have x > b as soon as |z| > bcosec 0. 
With that restriction on z, 


CN} z|N+ 
R N+1 =e O 
Rn Sn oe 


ifx > b. 


The same method proves the following extension. [fa > —1 and 


[e.8) 
f(t) = t* Da,t" for 0 < t < A, (with the same restrictions on | f(t)| 
0 


and |phz| as in the theorem), then F(z) = |  e-stfi (t)dt has the 
0 


asymptotic expansion 
a3 san a, (a+n+1) 
ze 0 gurl 

We now establish the asymptotic formula for Log (1 +z). In it 
the interpretation of the Logarithm is the value arrived at by analytic 
continuation from the real value taken for real positive z. 

The constants B, are the Bernoulli numbers defined in exercises 
14(b). 
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Theorem 14.86. In the sector —47+06 < phz < 47-6 the Junction 
Log ['(1 +z) has the asymptotic expansion 


Boy, 


1 Se 
(z+4) log z— Tee > | Qn —1)2n Bt 


Proof. Applying theorem 14.85 to the integral in (3) of theorem 
14.84, we find (noting that B, = 1) that the asymptotic expansion of 


r- 
F(z)—— 1s aa. Dag 
= -sat p> Fini” 


By theorem 14.83, —F(z)+ logz+A+t so has the asymptotic ex- 


ansion 
P = Ban 


where A is a constant of integration, to be determined. 
Appealing again to theorem 14.83, we have as the asymptotic 
expansion of 
Log [(1+2z)—(¢+4) log z+z—Az—A, 


. Bon 


the series 2 (2n—1)2n 22"-1° 


The constants A and A, can be determined from the value found 
for log (n!) in theorem 14.81; we have A = Oand A, = 4 log 2z. | 


Notes. (1) The result of the theorem remains valid for 
—m7+06 < phz < 27-0. 


Our restriction to the smaller sector arises from the corresponding 
limitation in theorem 14.85. 

(2) To determine the multiple of 27i included in Log I'(1 +z) we 
should have to identify the branch derived by analytic continuation 
from the principal branch, which is real for positive z. In the calcula- 
tion of ['(1 +z) there is no difficulty, because any value of Log I'(1 +z) 
will determine it. 


NOTES ON CHAPTER 14 


§14.6. The definition of ! commonly called Gauss’s was in fact given by Euler 
and rediscovered by Gauss. The Weierstrass definition had been used by F. W. 
Newman (1848). 
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§14.8. If we substitute numerical values of the B;,, the asymptotic formula for 
Log I'(1+z) begins with the terms 


1 1 1 1 1 
1 = Se 
Greet 77 S08 eae ieee ena ee 
It is of great use in numerical work. It is often called Stirling’s series, but a more 
accurate description would be de Moivre’s form of Stirling’s series. It is astonish- 
ing that such a formula could have been found and used correctly (for real z) 


as early as 1730. (See Jeffreys and Jeffreys, Mathematical Physics, chapters 
15 and 17.) eas — 
pecans 


SOLUTIONS OF EXERCISES 


1(a) 
2. Use theorem 1.11. 


4. (ii) (AUC)—(BUC) = (A-B) NC, and 
(A—B)UC = (A-B)NC’ =Ce= oe. 
The opposite implication is obvious. 
5. (i) (AUB)—-B=A-B. (ii) (A-B)UB=AUB. 
6. (AU B)—(An B) = (AU B)N(AN BY = (AUB)N(A'U B). 
7. (AAB)AC = (AUBUC)N(AUBUCIN(VUBUC)JN(AUBUC) and 
this expression is symmetrical in A, B, C. Hence 
AA(BAC) = (BAC)AA = (AAB)AC. 
8. (ii) (A U C)A(BU C) = (AAB)N C’ and (AAB) UC = (AAB)NC’ > C= @. 
9. AAB = CAD = (AAB)AB = (CAD)AB = A = (BAD)AC 
> AAC = (BAD)A(CAC) = AAC = BAD. 
10. A—B, B—A, AN B are pairwise disjoint and 
= (A—B)U(ANB), B= (B-A)U(ANB), 
AU B= (A—B)U(B-4A) U(AN B). 
Hence |A| = |A—B|+|An Bl, |B| = |B—Al+|An Bl, 
|AU B| = |A—B|+|B-A|+|An BI. 


1(5) 


1. Let {{a}, {a, b}} = {{c}, {c, d}}. If a = b, then {{a}, {a, b}} = {{a}, {a}} = 
{{a}} and so {{c}, {c, d}} has one element only. Thus {c} = {c,d} and so c = d. 
Then {{c}, {c, d}} = {{c}} anda =. 

If a + b, the above argument shows that c + d. Also {a} = {c} or {a} = {c,d}. 
Since {c, d} has distinct elements, {a} + {c, d}. Hence {a} = {c}, i.e. a = c. Then 
{a, b} = {c,d} and so b = d. 


2. (i) xe(AUB)xC+x =(y,z), where yEAUB, zEeCexeEAxC or 
xEBxC+xe(AxC)u(BxC). 


3. A= B>AxB= BxA.If A + B, we may suppose that A has an element 
x not in B. If ye B, (x, y)¢ Ax B, but (x, y) ¢ Bx A. 

Take A = {a}. Then (AxA)xA = {((a,a), a}, Ax(AXA) = os (a, a))}. 
These are different since a + (a, a). 


4. {a}x {a} = {@, a)}. See 1. 


5. (i) ye f(U A) + 9 x € UA such that y = f(x) = ye f(A) for some AEE => 
yeU f(A). 

(ii) ye f(NQ A) 3xXxENA such that y = f(x) = ye f(A) for all AEG > 
yet f(A). 
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If fis injective, y = f(x) for-at-most one-x € X. Hence y € f(A) forall AcE => 
3xe€)A such that y = f(x). 

If f is not injective, there are distinct x, x’ € X such that f(x) = f(x’) = y, 
say. Take A = {x}, B = {x’}. Then f(AN B) = f( 2) = 2, f(A) nf) = $}. 


6. Proof similar to that of 5 (ii). 


7. Gi) x ef-(N B) = f(x) ENB=f(x) €B for all Be =x ef-(B) for all 
Be€E >xeE) f(D). 


9 G@xEeA=fHef(A=xef((A)). 

(ii) ye f(f-(B)) = 9 x € f-(B) such that y = f(x) = J x such that f()eB 
and y = f(x) > yeEB. 

If fis injective, then f(x) € f(A) > x € A; hence f-(f/(A)) = A. 

If f is not injective, there are distinct x, x’¢ A such that f(x) = f(x’) = y, 
say. Take A = {x}. Then f-(f(A)) = f-“({y}) > {x, x}, so that f-*(f/(A4))—- 
A+ @. 

If f is surjective, then y¢ B > J x such that y = f(x); hence f(f/-(B)) = B. 

If f is not surjective, Y—/(X) + @. Take B= Y—/f(X). Then f(/-(B))= 
f(@) = @. 


10. f not surjective = f,0/f, not surjective. f, not injective > f,0 f2 not in- 
jective. Hence f, is bijective and so are f, 10 (fo f,) =f, Gaohjof ? = Ss. 


11. The argument shows that xRy for some y implies xRx. But xRy need not 
hold for any y. (E.g. X = {x, y}, R = {0, »)}) 


1 (c) 
1. By theorem 1.43, D = C—A is countable. By theorem 1.42, A contains a 
countably infinite subset E. Since EU D is countable (theorem 1.45) and infinite, 
EU D ~ E. Then 
AUC=AUD = (A-E)U(EUD) ~ (A-EQUE=A. 


BnC is countable. Since B is uncountable, so is B—C = B—(BNC); in 
particular, B—C is infinite. Hence, by first part, 


B-C ~ (B-C)uU(BnNC) = B. 


2. (i) (0,1) ~ @, db): let y = a+(b-—a)x (xE (0, 1), yEG, d)). 

(ii) (0,1) ~ (2,00): let y—a=(1/x)-1 (xe, 1), ye(a,~)). Clearly 
(=o a) oes (—a, 00). (4), 1) <6 (=; 00): let 7 = x/A+ |x|) we (—o, 00), 
yet 1,1). 

(iii) (a, b) ~ (a, bj etc. by 1. 

3. If is countably infinite, then it is of the form {A,, As, ...}. An ~ [n—1, 7); 


let f,, be a bijection on A, to [n—1, n). Then f: U A, —> [0, ©) given by 
n=1 


f(@&) = f(x) if xe A, 
is a bijection. Argument similar when @ is finite. 


4. Let xe A. Clearly {x}x A, ~ A, and.so {x}x A, is countable. By theorem 


1.45, A, x A, = U ({x}x A,) is countable. Now use induction. 
re Ay 
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5. If Z, is the subset of F which consists of the sequences with infinitely many 0’s, 
then F, ~ (0, 1). For (p,, po, ...) € Ay may be associated with the unique binary 
representation 0.p,p,... (not using recurring 1’s) of a real number in (0, 1). 
Also A—F, is countable, since, for each n > 1, there is only a finite number of 
sequences with p; = 1 fori > n. By 1, 7 ~ (0, 1). 

Define f:72-F as follows. For p = (pi, Do,---)> 4 = (Gir qa, --J EF, 
F(p, 2 = (Pi; 91» Pos Yas «»-). Then f is bijective and so FY? ~ FY. This implies 
(0, 1)x (0, 1) ~ (0, 1) and, by 2, A, x A, ~ R*. Now use induction. 


6. Let S be the set of integers; A the set of all polynomials with integral coeffi- 
cients, A, the set of those of degree n. Since A, ~ S”*1, A, is countable and so 


c 
is A = UA,,. Each polynomial equated to 0 has a finite number of roots. 


nN 
7. 0 <i< 8: In the decimal representation of the members of R; replace 
i+1,...,9 by i, ..., 8 respectively. We get the set of nonary representations of 
the numbers in (0,1). Thus R; ~ (0,1). i = 9: Some decimals of R, have 
recurring 8’s; they form a countable set C. R,—C ~ (0, 1) and so R, ~ (0, 1). 


8. Let E, = En {0} and, for k = 1, 2, ..., put 
E, = En(k,0), E_.~.= En(-©, —k-). 
Each E; (i + 0) is finite, for otherwise a series of distinct terms of EF; could 


lo @) 
be formed and this would diverge. Hence E = U E; is countable. 


ee 


9. (i) Let A = {a, ..., an}. Let F be the set of all ordered sets (py,..., Dn) of 
0’s and 1’s. PF has 2” members. With S € 4 associate (pj, ..., Pn) €F defined by 
= is if a,e S, 
Ps NOt a €S. 
Clearly G4 ~ F. 
(ii) Now let F be the set of 5. As in (i), Ya ~ F. 
(iii) Suppose that A ~ Su, ie. that there is a bijection f: A > “44. For'xe A, 
put f(x) = S,. Define S* € Ya by 
xeS* ifandonlyif x¢€8,. (7) 
Since S* € .Y4, there is ae A such that S* = S,. Then (ft) with x = a becomes 
. aeS*=S, ifandonlyif a¢S, = S*. 
10. For each interval J of the collection .% choose a rational xz € J. The xr 
are distinct. Hence ¥ is similar to a subset of the set of rational numbers and so 
is countable. : : 
Suppose / is increasing. If c (not an end point of J) is a discontinuity, put J, = 
(f(c—), f(e+)). When c, < co, f(cx+) < f(ez—) and so the J, are disjoint. 
Hence the set of J, (and so of the c) is countable. 


2(a) 
1. (a) No; p(x, —x) = 0. (6) Yes. 


rth 
2. Tf pf), 7) = 0, then [ (¢—yW) = 0 for all x and 0<A< 1; thus 
x 


y 
I (¢—-y) = 0 for all y. Differentiate w.r.t. y: 6(y)— YQ) = 0 for all y. 
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3. y= x in (ii) gives 2p(x, z) => p(x, x) = 0. z= x in (ii) gives p(x, y) < 
p(y, x); interchange x, y to get p(y, x) < p(x, y). 


4. [S1n—S1l, pees [one Sel < hie x) < lE12—$1|+ eee + |Enx—Snl. 


5. Since ° P(Xnz Yn) S P(Xn, X)+ P(X, VY) +P Yn), 
P(Xns Yn) — P(X, Y) S P(Xn, X) + PC, Yn). 

Similarly P(X, Y)—PUXn, Yn) S P(X, Xn) + P(Vny Y). 

Hence |P(x, Y)—P(%ns Yn)| < P(x, Xn) + PCY, Yn). 


6. To prove M3 put p(x, vy) = a, p(y, z) = 5b, p(z, x) = c, so that a+b—c = 0. 


Then 
(a+ b—c)+ab(2+ Ds 
a(x, y)+o(y, z)—O(x, Z) = (1+a)(1+5)(+ ae 


p, o are equivalent, for if p(x,, x) > 0, then 


_ Pn; X)_ 
Bte  olors 
and if o(x,, x) > 0, then 
O(Xn; X) 
P(%ns X) = 7 a ai 


7. Example: on R!, p usual metric and o as in 6. Then o(x, y) < p(x, y), but 
a(x, y)/p(x, y) = 1/1 +|x—y|) +0 as x >, 


8.0 <p < 20 < 2p; use 7. 

p(x, a < a gives the disc D with centre a and radius 1. a(x, a) < 1 gives the 
square P with centre a and sides of length 2 parallel to the axes. T(x, a) < 1 
gives the square Q with centre a and sides of length ./2 at 45° to the axes. 
P> D=.0. 


9. The sequence (x,) converges in (R!, 7) if and only if x, is constant for all 
sufficiently large n. 


10. If hE C, h(x) = O for 0 < x < 1 and A(z) > O for some ce (0, 1], then 
1 

I h(x)dx > 0. For h(x) > 4h(a) in an interval [a, 5] < [0, 1] and 
0 


{, h(x)dx > f h(x)dx > th(a)(b—a). 


This proves M1 for o and 7. M2 is obvious and so is M3 for T. 
To prove M3 for o adapt the argument for the usual metric in R”: M3 is 


equivalent to : 
Sor <([#) (0) © 


and this follows from the fact that 


1 
iE {EA(x) + W(x) dx > O 


for all real &. 
Tt < o follows from (*) with ¥ = 1. 
In (C, 0) and in (C,7), fi > 9, where Ox) = 0 for 0 < x < 1. If V/,) con- 
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verged in (C,p) to f, say, then, since o(f,, f) < PU Sf); Sn would have to 
converge to fin (C, 7) also. Thus f = 9. But p(/,, 9) = 1 for all n, Le. f, 7 7] 
in (C, p). 

In (C, 7), Zn > 9. If (g,) converged in (C, ©), it would have to converge to 6. 
But o(g,, 9) = 1/J3 for all n. Also convergence in (C, p) would imply conver- 
gence in (C, 0). 


11. 0, < oO, < 03 < NO, < NO, 


2(5) 
2. If V is a normed vector space and p is defined by p(x, y) = ||x—yI|, then 
p(ax, A) = |jax—4l| = |lax|| = |||]. 


Hence, if V has an element other than the zero element, then p(x, y) takes all 
non-negative values. 


3. The condition for ||.|1, ||.||, to be equivalent is that |x,||, > 0 if and only if 
| xnll2 > 0. 

Suppose that there is no # such that ||x||, < “|x|, for all xe V. Then, for 
every integer n, there is an x, such that 


|Xnle > allxnlla 


Xn + 9, since otherwise ||x,||, = 0. Let 
Xn 


bee gaa ge 
n||Xnll1 


Then ||y,, = 1/n 0, but ||yn\|, > 1 for all x. Hence there is a # so that ||x||, < 
ul|x\|, for all x and clearly # > 0. Similarly there is a x > 0 such that |x|; < 
k||x||, for all xe V. 


4. If an inner product exists, 
lxtyl? = @&+y).@+y) = |x|? +x.yt+y.x4 |p, 
so that 2re(x.y) = ||x+yl]?—||x|]?—[>I)- 
Similarly 2im(x.y) = ||x+iy||?— |]x|]?— >I). 
6. x.06+x.6 = x.(0+9 = x.8. 
7. Take R? with ||x|| = max (|x,|, |x2|). If x = (1, 0), »y = ©, 1), 
jx+yl|l?+lx—yl? = 2, 2([||?+ [yl = 4. 
(Or ||x|] = |x| +|l-) | 
8. [Xn-Yn—X-¥| < [Xn-On—Y)| + |Gen—x)-y| 
< ||xnl| l¥n—y| + | %n—~l] [I]. 
9, Proof of N 2. Theorem 2.21 clearly holds in this situation. Then 
x+y]? = @t+y).et+y) = |x|? +2re(x.y)+ [ly]? 
< |x||?+2|x.y|+ [ly]? 
< (|x| +y/). 
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2(c) 


1. X contains a countable subset S. Denote the (distinct) members of S by 
A, X1, Xe, .... Let p(x, x) = 0 (all xe X), p(x, y) = 1 (& + y; x, y not both in S), 
P(Xm, Xn) = |m-1—n-|, p(a, Xn) = n-!. M3 requires checking. 


2. If c is a limit point, there is a sequence (x, such that p(a, x,) < r (all n) and 
Xn > c. Since p(a, Xn) > p(a, c) (exercise 2(a), 5), p(a, c) < r. 


3. If the supremum (infimum) did not belong to the set, it would be a limit point 
outside the set. 


4. Let c be a limit point of E*. Given ¢, B(c;¢)—{c} contains an x € E*. Let 
0 be such that B(x; 0) < B(c; €)—{c}. Since B(x; 6)—{x} contains a point of E, 
so does B(c; €)—{c}. 


5. Each open set contains an open ball which contains a point with rational co- 
ordinates (cf. exercise 1(c), 10). 
No. If R” is given the discrete metric, every set {x} is open. 


6. (i) K is the union of all open sets G < E. Every GC E° and so Kc E°. 
K > E’, since E° isa G. 

(ii) Hi is the intersection of all closed sets F > E. Every F > EandsoH > E. 
H © E, since E is an F. 


7. For any x e€ X, there are three exclusive and exhaustive possibilities: for 
every €¢, Bix;6)N E+ @ and B;6)NE’ + 2; for somee, Bax; gNnE= @; 
for some ¢, Bx; e)N E’ = @. 

xé(E)’=x¢E-<= for somec, B(x; 6)N E = @ + for somee, B(x; 6) < E’= 
xée(E’). 


8. By 7 (2nd part), E’ = (E°). Use (2.31). 


9. (i) xE E,U Ey U E, <> there is a sequence (x,) in Ey U E, such that x, > x = there 
is a sequence (x,,) in EZ, or in E, such that x, >x<+xek, or xe Boxe 
E,U E,. Use induction. 

For arbitrary collections, the right to left implications hold. Also, if in R! 
(usual metric), E, = {1/n}, VE,—-UE, = {0}. 

(ii) x €() E = for every ¢, B(x; €) contains aye QE = xeE for every E = 
xENE. 

If, in R’, E, = {1,4,4,...3, A = {-1, -4, -4,..,40B = 0, BE, nk£, = 
{0}. 

Gi) (2,90 ..0E) = £n...0nf,;01 BD = ie ey eve 


10. Use the sequence definition of limit point. For E° use 7 (2nd part) or in- 
variance of open sets and 6 (i). 


fi Ccope Feo Pe C=C. 


3(@) 
I. fo me) = m/(1 +m’). 


act (Xn); (x/) are sequences in E— - {xo} with limit Xo, thet’ GO) = Gy, X,. Xm 
x3, ...) is of same kind. Since limf(x’) exists, lim/(x,) = limf(x’,). Apply 
theorem 3.1. 


CHAPTER 3 459 


3. Given e¢, there is 6 such that o(f(x), yo) < € whenever 0 < p(x, Xo) < 0 
and xe E. There are infinitely many such x (x, is a limit point of EZ) and so 
B(yo; €) contains points of f(E). If fis injective, Bo; €) contains infinitely many 
points of f(Z) and so does B(; €)— {Yo}. 


4. Take e. There is 7 such that 0 < p(y, yo) < 9 and yE E => 7(g(y), Wo) <é. 
There is 6 such that 0 < p(x, x.) < 6 and xe D = o(f(), yo) < 9. If f is 
injective, there is at most one x for which f(x) = yo. Then 0 may be chosen so 
that 0 < p(x, Xo) < dand xe D = 0 < o(f(%), yo) < 7%. 

Example: D = X = E= Y = (0, 1],f@) =00 < x < 1), g(0) = 0,20) = 
1(0 < y < 1). g(y) ~1lasy +0, but hx) =00<x< l). 


5. By theorem 3.1 (necessity), if (x,) is any sequence in E— {x} with limx, = Xo, 
then P(xn) > &, W(xn) > B, f(%n) > Vor &(%n) > Zo aS n> 00. By exercise 2(), 
8, D(Xn)f (Xn) + VXne(Xn) > Mot fhZ and (3.11) follows from theorem 3.1 
(sufficiency). 


3(b) 


1. fcontinuous on X. Let x ¢ f—*(F). There is (x,) in f“(F) such that x, > x. 
f(x) €F. Since f is continuous at x, f(xn) >f(x) and, since F is closed, 
f(x) € F. Thus x €f—(F). 

f not continuous on X. If f is discontinuous at x, there are « > 0 and (xn) 
such that x,>x, but o(f(x,),f(x)) 26. Let S = {f(%), f(%2), ..-}. Then 
f(x) €S and so x ¢f-1(S). Thus f-(S) is not closed. 


2. Suppose f continuous at x». If N is open and f(x») € N, there is € such that 
B(f(x»); €) < N. There now exists 6 such that x ¢ B(x);6) implies f(x)é 
B(f (x); €). Take M = B(x; °). 

Converse. Given ¢, take N = B(f(x,); €). There is an open set M such that 
x,.¢M and f(M) < N. There is 6 such that B(x,; 6) < M. 


3. Define fon R! by f(x) = 0 (x rational), f(x) = 1 (& irrational). Take E to be 
the set of rational numbers. 


4. Continuity of f obvious. Let y be a limit point of Y and let y, > y, where 
Yn + y. Then f0») +f“), for otherwise f(y) would be a limit point of X. 

Take X¥ = {0,1,2,...3, Y = {0,1,4,4,...} amd use metrics induced by R’. 
Take f(0) = 0, f(n) = 1/n (a 2 1). 


5. p,o equivalent = (X, p), (X,7) have same open sets + J, J~-* continuous 
(theorem 3.23). 

Let p be the discrete metric and o the usual metric on R‘. Then J: (R’, p) > 
(R!, 7) is everywhere continuous, but J! : (R', 7) > (R’, p) is nowhere continuous. 


6. G = (2, 3), # —{0, 1). 
8. If p is the usual metric in R”, 


mae 110) —f(x)| < pf), f(*)) < Dl fi%o) — fx) |. 
3 


Let f: R” > R” be linear and have matrix (@;;).f; is continuous at xg, since 


[fi(x0) — fix)| < (lan|+...+ |@im|)||xo— xl. 
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9. If w is any one of the equivalent metrics on Xx Y, 


w((x, Vo) (Xo, Yo)) — p(x, Xo), W((Xo; y), (Xo, Yo)) i o(y, Yo). 
For the example take f: R? > R' defined by 


ee [F when (x, y) + (0, 0), 
0 when (x, y) = (0, 0) 
and Xo = Yo = 0. 


10. Let ¢,(x) = (1—x)” for 0 < x < 1. Then |f(@,)||/||d,| 


11. Let a = (q@,...,a,). By Cauchy’s inequality, |f(x)| = |a.x| < |la|| |x|, 
for all x € R". Also |f(a)| = |lai|?. Hence | f|| = ||al|. (@ = @ included.) 


ae ee < |g [fC] < |e | f] |x||.f-of is the identity function J on U 
an an; 

Let U be R® with ||x|| = max (|x,|, |xo|), let V be R? with norm |x| = 
|x1|+|x2|, let f: U > V be defined by f(x) = x. Then 


nN, 


‘|x| + |x9| 


a. % max(|x,|, |x2|) 
max(|x,|, |x2|) 


Aaa + |xz2| 


|f] = sup = 2 Fo = see = 1. 


3(c) 
1. If 7 is metric on E induced by CX, p) or (Y, o), then (EZ, 7) is a metric subspace 
of CX, p) [(Y, 7] and so E is connected in CX, p) [(Y, )] if and only if (£, 7) 
is connected. 


2. Let G:N GaN {x} = B, G,U G,> {x}. If xe G,, then x ¢ G,, i.e. G,N{x}= o. 
Let E contain at least two points and let x be an isolated point of E. There is 
6 such that B(x; 6)n E = {x}. Take G, = B(x; 40), G, = (G,)’. 


3. X connected = E, E’ not both closed (all E+ 9, X)= E¢Eor E’¢ E’ 
@lik + ¢, XyofE =£0E + 2 (ll E+ o,. Xx). 
4. Let S be set of irrational numbers and let E(< S) contain x, y, where x<y. 


There is a rational r such that x <r< y. Take G, = (—~,r)nNS, G, = 
(r, O)N S. 


5S. If G is open and Gn E + @, then GN E + @. For let xe GN E. There is 6 
such that B(x; 0) < G and B(x;0)n E+. Hence, if E is not connected, 
so that there are G,, G, with 


GE +2, G.NE + 2) GNGnk =2. G06; se 
then Gf #8, GE +e, GAGE = & 6 ees. 
i.e. E is not connected. 

No. Take E = (0, 1) U C1, 2). 


6. If C is a component of a closed set F, then 
C is connected. Since C > C and C is maximal, 
No. See 4. 


7. f: (0, 1] + R? given by f(x) = (x, sin (1/x)) is continuous. Since (0, 1] is 
connected, by theorem 3.34, so is E = {(x, y)|0 < x < 1, y = sin (1/x)}. By 5, 
E = Ev {(x, y)|x = 0, —1 < y < 1} is also connected. 


C < Fand. C<f.= F. By 5, 
L. == &, 
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3(@) 
1. If (x,) is any sequence in E—{x,} with limit x», then lim f(x,) exists. Apply 
Y | me a®,2) 

exercise 3(a), 2. 
2. s, ein R‘. Hence (s,,) is a Cauchy sequence in R! and so in Q. 

0 < qg\(e—s,) < ge i (*) 

Gel assy q 

Ife = p/q,q!e is an integer (as well as q!s,) and so is q!(e—,). This is false by (*). 
Hence e is irrational and (s,) cannot converge in Q, for if it did (s,) would have 
two distinct limits in R?. 


3. Let ¢,(x) = (x"+1)"" 0 < x < 2). (Y,) converges in B[0, 2] to ¢ given by 


1: doe OO &..x <I, 
P(x) = | 
oe eee Se eee 
for p(¢,,, 6) < 2(2¥"—1). Thus (¢,,) is a Cauchy sequence in B[0, 2] and so in D. 
Since ¢ ¢ D, (¢,) cannot converge in D, for if it did (¢,) would have two limits 
in B[O, 2]. 


4. (n) is a non-convergent Cauchy sequence since n/(1+7) > 1, but there is no 
real x such that x/(1+ |x|) = 1. 


5. No. Let p be the usual metric on R! and o the metric of 4. Then p, o are 
equivalent; for f: R! > R} given by f(x) = x/(1+ |x|) is continuous and strictly 
increasing, so that f(x,) > f(x) if and only if x, > x. 


6. (i) € is a collection of complete subsets E of X. PutH= ) E.IfH= @, 
Heé 


then H is complete. If H + 2, let (x,) be a Cauchy sequence in H and so in E. 
There is an x € E such that x, > x. Limits in X are unique. Hence x is the same 
for all E, and xe H. 

(ii) We need only consider two complete sets E,, E,. Let (x,) be a Cauchy 
sequence in E,U Ey. At least one of E,, E,, say £,, contains an infinite sub- 
sequence (x,,). Since E, is complete, there is an x € E; such that x,, > x. Since 
also (x,) is a Cauchy sequence, 


P(x, Xn) < p(x, Xn) + P(Xyqs et. 


7. Let x,€F,. If mn > Mo, P(Xms Xn) < P(Fr,) and as p(F,) +90, (Xn) is a 
Cauchy sequence. Since (X,/) is complete, there is an x such that x, > x; 
and since each F, is closed, x € () Fy. If x’ € 1 Fa, then p(x, x’) < p(F,) for all n 
and so p(x, x) = 0, 10.00 & 

(i) X = (0, 1], F, = (0, 1/n] (closed in X). 

Gi) X = RL Ff = ©, 1/nl. 

(iii) X = R}, F, = [n, ©). For a counter example with bounded F,, let X = 
C[0, 1] and let F,, be the (closed) set of continuous functions f on [0, 1] such that 


f0)=0, 0< f() <1in(0,1/n), f(*~) = 1 in [1/”, 1). 


Then p(F,) = 1 and()F, = 2, since there is no continuous fsuch that f(0) = 0 
and f(x) = 1 in U[i/z, 1] = (@, 1]. 
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8. Px) = 2x"O<x< 1), x) es 


P(gn, ) = 1 for all n. 
9. Let (¢,) be a Cauchy sequence in L(V, W). For any xe V, 
| P(x) — Pa(x)|| = (Pm— n(x) < |'Pm—Pnll ||| 
and so (¢,(x)) is a Cauchy sequence in the complete space W. 
Define 6: V > W by $(x) = lim ¢,(x). Then 
Plax + Py) _ Plax + Py) =a {2Pn(x) + fry} = a(x) + f(y), 


i.e. d is linear. 

Since (¢,) is a Cauchy sequence, ||¢,,|| < A, say. (Adapt first stage of proof of 
theorem 3.42.) As |d(x)|| = lim |¢,(x)| < Allx|, by theorem 3.24, ¢ is con- 
tinuous. 

Given ¢, there is ) such that |¢,,—¢,| < ¢ for m,n > no. For any xe V 


BnS # > ||P) — ba(x)|| = Tim bmx) — ba(2)| < €||x|. 


Thus ||¢—¢,|| < € and so ¢, > ¢. 


10. [a,b] is complete and /: [a,b] > [a,b] is a contraction mapping since 


fy) —F2)| = [FE — %2)| < kl xy — x9]. 
11. Define 2 on Bfa, 5] by 
QYY(x) = WX)—-f&, ¥))/M (Ye Bla, 5), x € [a, 5). 
Since w(x) is bounded for a < x < b, so is f(x, w(x)); hence Q(y) € Bia, 6]. Also 
ILRWVE)— 2G DOO] = [[YiGd)— ACO] — A.C, w)/M)| 
< |¥u@)— ¥2(~)|(1 — m/M), 
i.e. POQMV?), OG/2)) < A —m/M)p,, Ye). 


Thus Q is a contraction mapping and there is a unique ¢ such that Q(¢) = ¢, 
i.e. f(x, (x)) = 0. Moreover, there are no unbounded solutions. The argument 
at the end of the proof of theorem 3.47 shows that ¢ is continuous. 


12. Put M = [xo—&, x» +], N = [yo—5, yo + 5] and define DQ on C(M, N) by 
QI) = yor [LG WOME (re CM, N), xe M), 
Qc) is continuous (C1, theorem 7.61) and, for x e M, 
ILRWA)—yYo] < |x-x9|B < «B< b, 
ie. O(W)(x) € N. Also 
lOM D(x) — OG) < | ik If Yul) - FG, Og 


< 


S aAp VY, ey) 


| ” Altes) — Wald) |e 


and so 2) is a contraction mapping. 
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13. (i) Q(@)—x = 1/(1+e*) > O for all xe R*. (ii) For all distinct x, x2, € R’, 


x) - 9G] = | aad + (Se -Fas) 
eé 
(%1— 2) (1 = a) < |x1— Xe]. 
14. Since a fixed point of Q is a fixed point of Q%, Q cannot have more than one 


fixed point. 
Let x be the unique fixed point of Q”. Then 


QMO()) = QCQ°%@)) = QO), 
ie. Q(x) is a fixed point of QO”, Hence Q(x) = x. 


15. Take $,(x) = 1, $.(x) = 0 on [0, 1]. Then p(y, Ge) = PQ), Q2)) = 1. 
By integration by parts, 


(2G), AY)) = su _ [, * (x— GO) —VO}dt < 404, W. 


3(e) 
1. Let (Xo, %>) be a subspace of (Y, ) isometric with CX, p). Since X, is closed, 
by theorem 3.43 it is complete in (Y, 7). (Xo, *) @ restriction of o to 5 ae 
is a completion of CX, p). 


2. Take X = set of non-negative rational numbers, Y = [0,°), Y, = [1, 0), 
all with metric induced by R?. 


3. For any x € X*+, there is a sequence (x,) in X convergent to x. (Q(x,)) is a 
Cauchy sequence in X, since 


P(Q% nm), 2% n)) < KPC%ms Xn); 
and so has a limit in X+. If also x, > x (x, € X), then 
PLUK), 20%;,)) < ke(Xns X,) > 0 
and so lim Q(x’) = lim Q(x,). Hence define Q* on X~ by 
| Ot(x) = lim OC,). 
The restriction of Qt to X is Q, since, when x € X, a sequence from X con- 


verging to x is (x, x, ...). 
Qt is a contraction. If x, ye X* and xn > X, Yn > Y (Xn, Yn © X); 


POC), 20 n)) _ P7 On), LOW)) _, PTO*E), O*0)) 
P(Xns Yn) oe ys) ra) 


Since I.h.s. < & for all n, r.h.s. < k. 
Q* is unique. Any continuous f: X* > X* which extends © satisfies f(x) = 
lim Q(x,,) (x € X*, Xn € X, Xn > X). 


| 3(f) 
1. A subsequence (x,,) of a sequence (x,) converges in (X, p) if and only if it 
converges in CX, 7). 
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2. Let E be a compact set in (X, pe) and let (x,) in E converge to x € X. There 
is a subsequence which converges in E. Since every subsequence converges to 
x, x € E. Use theorem 2.31. 


3. Let ¢,(x) = 1 for x = 1/n, 0 elsewhere in [0, 1]. If (?,,) is any subsequence of 
(Prn)> Pr» Pv) = 1k + 1) and so (¢,,) does not converge. 


4. If x, +> x, there are a subsequence (x,,) and an € > 0 such that p(x,,, x) > € 
for all k. By compactness, (xy) has a convergent subsequence (x, i" Since (xp,,) 
is a subsequence of (x,), Xy, > xX Contradiction. 


5. (i) F closed, E compact. Use theorem 3.63: Ef F is a closed subset of the 
compact set E. 


Let & be a collection of compact sets E. Since each E is closed, H = () is 
Heé@ 


closed. If E* is any member of E, H = E* f H is compact. 

(ii) We need only consider two compact sets E,, E,. Let (x,) be a sequence 
in £, U E,. At least one of E,, E,, say E,, contains an infinite subsequence (x,,). 
There is a subsequence (Xp,,) of (x,,) and so of (x,) convergent in E, and so in 
E,U Ey. 

6. (X, ~) compact, (F,) contracting sequence of non-empty closed sets in X. 
Let x,€ Fn. (Xn) has a subsequence (x,,) converging to an x ¢ X. For any n, 


co 
xy, € F, if y» 2 n and since F, is closed, x € F,. Thus x€ () F,. 
n=1 


7. Takee > 0. Thereis b > asuch that | f(x)—/| < 4¢for x = b. When x,, x2 > 
b, | f(x) — f(x2)| < ¢. Also fis uniformly continuous on [a, b+ 1] and so there is 
d6(0 < 6 < 1) such that | f(x) — f(x2)| < € when x, x, € [a, b+ 1Jand |x,—x,| < 
é. 


If |x,—x.| < 6, then x,, x,¢[a,b+1] or x1, x,€[b, 0) (possibly both). 
In either case, | f(x) — f(x2)| < ©. 


8. fnot uniformly continuous: if d > 0, (x+0)?—x? >a asx>om, 

g,h,k uniformly continuous: |g(x,)—g(x2)| < |x1—~x2|; for h use 7; for k 
use argument similar to that in 7, since |x?—x3| = |x,—x2|/(x}+x}) <e 
if |x,—x,| < 1 and x, x. > b, say. 


9. There is 0 such that |f(x,)—f(x.)| < € when x, x2 € [a, b] and |x,—x,| < 6. 
let @= Xo < %, <...5 2, <%= 8, wee <i. == 
Define h on [a, 5] by putting 


h(x;) = I(x) (i _ 0, i, seey n) 


and letting h be linear in each [x,_,, x,]. If x € [x;_1, x,] and x = (1—@)x;_,+ax; 
(0 < a < 1), then 


|Ax) —f()| = |1-%) fi) +a f(x) —f()| 
< (1—2)|f(%i-1) -—f()| + 2| fe) -—f)| < e. 
10. fis uniformly continuous, for 


f(Ps)(x) —f (92) (x)| sia Kc Po) 
and so p(f(¢1), f(¢2)) < PCP, $2)- 


< [ tiv <tae 
0 
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g is continuous. For let ye C[0, 1]. Then [y(x)| < A, say, where A > 0. 
If p(y, x) < 0 < A, 3 


Ie@)-20@l = | re “—-w+)| < 824 +8) 


and so p(g(v), e(x)) < 3Aé. But g is not uniformly continuous. For if y,(x) = n, 
Xa(x) = nt+1/n (0 < x < 1), Pas Xn) = 1/n, while p(gn), &(Xn)) 2 2. 


11. There are sequences (x,), (v,) in E such that p(x,, y,) > P(E). (x,) has a 
subsequence (x,,) which converges to a point xe€ E. (y,,) has a subsequence 
()»,,) which converges to a point y € E. Since x,,, > x, p(x, y) = p(E). 

Take X = B[0, 1] with usual metric, and E = {¢,, ¢o, ...}, where 


1-—1/n when x = I1/n, 
P(X) ~— { ; 
0 elsewhere in [0, 1]. 
Since p(¢,, $n) = 1—1/n (m < n), a sequence of distinct points of E cannot be 
a Cauchy sequence. Hence E has no limit points and so E is closed. p(£) = 1, 
but there are no m, n such that p(¢,,, ¢,) = 1. 


12. inf p(x, z) < inf {e(x, y) +p, z)} = ep, y)+ inf pQ, z), 
2eH zek zek 

i.e. p(x, E)—p(y, E) < p(x, y). 

Similarly p(y, E)—p(x, E) < py, x) 

and so |e(x, E)—p(y, E)| < pe, y). 


p(x, E) = 0+ every B(x; «) contains a point of E= xe FE. 


13. There are sequences (x,) in A, (y,) in B such that p(x%;, yn) > P(A, B). 
(x,) has a subsequence (x,,) converging to a point ae A and (),,) has a sub- 
sequence converging to a point be B. Then p(a, b) = p(A, B). 

In R" take A compact, B closed. For a suitable finite, closed interval J, p(A, B) = 
p(A, Bn J). 


14. (i) A compact, B closed. Suppose p(A, B) = 0. Take (x,) in A, (y,) in B 
such that p(x, yn) > 0. (xn) has a subsequence (x,,) converging to an x€ A. 
Then y,, > x and, since B is closed, x ¢ B. Thus ANB + @. 

(ii) In B[O, 1] take A = {d}, where d(x) = Ofor0 < x < 1, B= {4,, dao, ...}, 
where 


1+1/n when x = 1/n, 
Pn(x) = 
A is compact, B is closed (cf. 12), p(A, B) = 1, p(¢, ¢,) > 1 for all n. 


Bb. A= FA = 2,3, 3,8 = 2+4,3 +4, ...}. 


0 elsewhere in [0, 1]. 


3(g) 


p 
1. Let E be a set in CX, p). Suppose E < U B(c,; 1). If a is an arbitrary point of 
n=1 
X and r = max (a, c,), then E < Bla; r+1). 


1<n<xp 
Let X be an infinite set, o the discrete metric on X. Then X is bounded in 


(X, 7), but not totally bounded. 
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2. Let E be a set in CX, p). E can be covered by a finite number of open balls 
with radius 1/k. Hence there are infinitely many members of (x;_, ,) lying in 
one such open ball. 

If m,n > No, Xm.m and Xn, lie in an open ball of radius 1/n) and so 
Pian Xn,n) se 2/no. 


3. Suppose f: CX, p) > CY, ©) continuous, (X,) compact. Let « > 0. Given 
ce X, there is <d(c) such that o(f(),f(c))<e when xe Bc; d(c)). 
{B(c; 46(c))|c € X} is an open covering of X. Let {B(c;; d(c;))|1 < i < n} bea 
finite subcovering and put 6 = min{d(c,), ..., O(c,)}. Let plu, v) < 46. If 
ue B(c;; 40(c,)), then u, v € B(c;; O(c,;)) and so o(f(u), f@)) < 2¢. 
4. The set of (x—d(x), x+0(x)) (@ < x < f) is an open covering of [a, /]. 
There is a finite subcovering corresponding to x, ..., Xn, Say. Suppose 
Cas © 2. < 3... = tas = ee 

Put d(x,) = 6,, (x,—46;, x;+6,) = 1; We may suppose that no J; contains an 
I, (i + j). 

For i=1,...,n—1, J; and J,,, have a common point )y; such that 
Xi < Vi < Xi41. (Otherwise there would exist z such that x;+6; < z < x%i41—9i41 
and zeJ;, where 7 <i or j > i+1; and then J; ¢ J; or [,4, < J;.) Hence 


f(x) < f(xix). But f@) < f(x), fxn) < f(A). Hence f() < f(P). 


4(a) 


1. O. (a) Yes. (6) No: x, = 2p. 
o. (a) No: x, = G", 01, = G)". ©) Yes. 
~. (a) No: xn = (4)"+(", 2p = G" (0) No: x, = 14+(1/n), t» = 1. 


2 O84< [Se- ao ges 
; vn EF ~ (k=-1)n 1 . 


a] ™4 
Gi) BF —< 1+ [" —dx = 1+loga. 
n 
(iii) If -1 < k < 0, zr < 1+ |" x'dx. Ifk > 0, Sr® < nn". 
1 


3. Un—Un = {(Un/vn)— 1}v, > 0. Counter example: u, = n+1, vz = x. 


4. (Xn+1—Xn) increases. Hence the bounded sequence (x,) either decreases or 
eventually increases. 


5. (i) For |x| < 4, say. 
llog(1+x)—x| = |—4x?+4x9-...] < 4]x|?(1 + |x] + |x]?+...) < [x]? 


(ii) a > O: (*) +>Q andso 


=) (+a) < K(|x|?+|x|®+...). 


a < 0: by above, 


(1+x)* = : 


f—oxrFO0G) = 1+ax-—O(x*)+ p2 {ax — O(x?)}*. 
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6. The inequality follows from the fact that, in [r—4, r+4], the curve y = 1/x 
lies between the tangent at (r,1/r) and the chord joining (r—4, 1/(r—4)), 


(+4, 1/¢+4)). 


fr po [nO (p) = 08(2-a59) +09) 
ats 22:15 

n+il n+4 xX s 2n+1 
7 1 1° 


7 
nt — Se ins Poe “ieeiee. 


<5 
1 


4 (5) 
1. If the upper and lower limits are finite, x,/v, is bounded by theorem 4.21. 
Converse obvious. 


2. lim sup x, = © <> sup x, = © < given K>O, there is nm such that 
Xn > K = there is (,) such that Xy > ®. 


3. (i) Nothing to prove when lim inf x, = — oo. Otherwise 7 en), (v,) bounded 
below and inf x, < x, < y, for all s > n, so that inf x, < inf y,; let n > 00. 
TZN T2>N s2n 


4. infx,+infy, < x,+y,(alls), soinfx,+infy, < inf(x,+y,). X.+ Ys < Xs+sSupy, 
(all s), so inf (x,+y;) < inf (x,+sup y,) = inf x,+sup y,. 
For < everywhere take 


(xn) = Gy ~1, 1, ands ey * (vn) = (=, 1, 0, 0, =A, 1, 0, 0, ae 5 
5. (Xn) iat (i, wm 2 1, 25 ES aads (vn) =i (3, 1, ay 3, 3 e2 rare» 


6. (i) (a) Let A = lim inf x,. There is a subsequence (x,,) such that x, >A. 
Then x,Vu, > Ay. If x,y, > ¢, then, since y > 0, x, = yy,)/y,, > cy. 
As c/y = A, c > Ay. Hence no subsequence of (x,,y,) has a limit <Ay and so 
Ay = lim inf enV: 

(6) If lim inf x, = 0, then x,>0o and x,y,7>0. If liminfx, = —o, 
there is (x,,) such that x,, > —0o; then x, y,, > —©. 

(ii) x, = n (n odd), x, = 1 (nm even), y, = —1/n make (a), (d) false. 

(iii) If y, > 0, then x,y, > 0. If y, > y < 0, use (i) (@), (c) and (4.23). 


7. Let lim inf x, = A. Given € > 0, take 6 > 0 so that |f(A)—f(x)| < € when 
|A—x| < 6. Since x, > A—O for n> Mo, say, f(Xn) = f(A—98) > f(A)—e for 
n> no. Similarly f(x,) < f(A)+e for infinitely many z. 


8. First inequality: Let lim inf (u,/v,) = A. Nothing to prove if A = — oo, 
If A finite and e > 0, u, > (A—e)v, for n > N, say. Then, 


Sa Se See I ge a> A= Je 

Te t i, 
forn > N’(>N), say, since t,, > oo. Thus lim inf — > ALIfA = ©, u,/v,>00 
and, by similar argument, s,,/t, > ©. 


9. (i) Un = Xn, Vn = 1 in 8. (ii) uw = log x,, vz, = 1 in 8; use 7. (iii) For 
n > 1, replace x, by x,/Xn_, in (ii). 

10. G) uy = n*, vy = 1, on = n**—-(n—1)*41-(n > 1) in 8. Gi) x, = nin 9(iii). 
(iii) x, = n”/n! in 9 (iii) gives lim n/(n!)¥" = lim [1+(1/n)]" = e. 
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11. If lim g,(6) = 1,, lim g.(6) = 4, then lim max {g,(4), g2(0)} = max (A, /2). 
6—0 6—0 5—0 


12. Let f be upper semi-continuous in [a, b]. If f is not bounded above, there 
is a sequence (x,) in [a, b] such that f(x,) > ©. (xn) has a subsequence (x,,) 
with limit ceé[a, b]. Then ps bes f(x) = © > f(c) (suitably modified if c=a 


or 5). Similar argument ion ‘that sup f(x) is attained. 


13. By the analogue of theorem 4.21 (ii)(c), Df(c) > 0 implies that f increases 
at c in the sense of exercise 3(g), 4. 

If Df(x) => 0 and e>0, D{f(x)+ex} > 0. Thus f(x)+ex increases, Le. 
f(a)t+ea < f(f)+ef when a < a < # < BD. True for alle > 0. 

Counter example: f on [0, 2] defined by f(x) = x 0 < x < 1), f(X) = x- 1 
(i= x ea). 


4(c) 
1. X(1/n?) converges, X(1/n) diverges. 


2. Lup, LUV,, Where 


(4)" (n on -{* (n odd), 
: xf ia (n even), 


3” (n even). 


ww 


. By 4(d), 7 and 9, 


lim sup a@t--+40)!n = glimsup(+--+un)in < giimsupmm < J, 


4. (i) converges, (ii) diverges. 


lin (n!)vn n 1 
= _ 10. 
(1 — in)" a a ee ores 
6. |x| < 1. 
7. |a| < 1 and all 5, |a| = 1 and |d| < 1. 


4(d) 

1. If lim inf (u,/v,) = A, lim sup (u,/v,) = A, then 4Av, < uz < 2Av, for n > 
N, say. 
2. (i) Convergence for p > 1 and all g, p = 1 andg > 1. 

(ii) u, = n708l0e" < n-* (n > N); Lu, converges. 

(iii) U, = n108108108": convergence. 

(iv) u, = e-908l0g™* > g-logm = n- (n > N); divergence. 

(v) n-1-9//n-1 + 1; divergence. 

(vi) U, = O(A/n?). 


a nN 
(vii) u,/n-* = ( +) +> e*: convergence for a > 1. 


a 1 1 
(viii) ~ oa} — —+ ol; ); convergence for p > 2. 
Un+1 2n 
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Gx Es (ett > logan > 05 Oa bof ke 89) (Tn) 
+> —loga > 0. Hence, when 0 < a < 1 or a > 1, series converges for k > 1. 
When a = J, series converges for all k. 


(x) {e908 — 1}/{(log n)/n} > 1; convergence for k > 1. 


KR (1-= 28") [a> n® exp [tog (1-8) 
ey ee -n 


é 1 1 2 2 
= n* exp {n| -* PB" +0( =e) || = Oxp {o (727)! > 1. 


a 1+ — ae. Lu, converges for 0 < a < I/e, diverges for a > I1/e. 


4 


n 


For a = I1/e, acess 3 (1 +) . We show that 
a n 


1 1\* pe £41 
(4) . (147) ltl (+) 
so that Xu, diverges by theorem 4.41. To prove (*) it is sufficient to show that 


n+1 
1+) decreases (to e). We have 


“+1 e+1 
41 tPSSfeeles Tie ca 
dx = 2 x x 


5. (p— 1). + pup t+ p'u,z2+ ... Fp"Uyn) 
S (Uy t... + Uy) t+ Up t... Upp... + Ugn +... + Ugnti_y), 
(p—1)@p+ puz2+ p23 +... +p" Un) 
S (Ugt... Uy) + Upgr t 0. H Up?) +... F(t. $2. EUpn). 
an-* convergent <> X(2*-1)-" convergent < k > 1. Use induction. 


6. X(u,/s,) diverges, for, when p is sufficiently large, 


Un+y Un+p _ Sntp— Sn > 4 


4 ee a 


Snt1 Snip Snip 


k > 1. The inequality is equivalent to f(x) = (kK—1)x*—kx*-14+1 > 0 for 
0< x < 1(&% = 5y_,/S,). This holds since f(1) = 0 and f(x) < Ofor0 < x < 1. 


co 
7. Let 2s ig = Ph, 80 thata, = rare Take 0g sfrny—<lra. 
n+1 


nT 
8. vb, = Un/Sn, Where Ss, = >)u, (use 6). 
1 


10. Take n, = 0 and ny, ng, ... so that 
Wit ..-tiyn, > 1 and w,/i.,<+4 for n> n, 
Us nsite-tuen, > 1 and Ugn/lgan <4 for n> ny, etc. 


Define u, = Uj,n for m <n < ny,1. Clearly Xu, diverges. Also, for every k 
Un/Uz,n > 0 aS n > ©, for, When 1 > Nysi, Un/Unn < 1/24. 


b 
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To construct the convergent series, take ny = 0 and n,, m2, ... such that 
Uanitin > 2 for n>m and UsansitUeniet... < if. 
lis ailing > 2° for n> wp and Ue na tigngat... < 1/2 ete 


Again define u, = u;,,, for ny, <n < Me+1. 


4(e) 
1. (i) Divergence, since a!” > 1. 

(ii) Convergence for x = km (kK = 0, +1, +2,...), otherwise divergence. 
We may take 0 < |x| < 7. Then there are infinitely many n such that 
|nx —(2r+4)7| < 4|x|, for some integer r, and so lim sup sin mx > cos 4|x| > 0. 
Also cos 1 < cos (sin nx) < 1. Hence (sin nx)/cos(sin nx) +> 0. 

(iii) &(2n)!/n?" converges by theorem 4.32. 

(v) Absolute convergence for |x| + 1 (theorem 4.32), convergence for 
x = —1, divergence for x = 1. 


(vi) £ (x1) = 5 (1-log x)x'* < 0 for x > e. Use theorem 4.53. See 


also 4(d), 2(x). 
(vii) Convergence for g < 0 and all p,q = Oandp < 0. 
(viii) Use theorem 4.54. 


sin nx COSHX . xX 


sin 
log n n 


converges for x + 2km (k = 0, +1, +2,...) and diverges for x = 2k7, since 
sin (2k7r/n) 2kr 
log n ~ hn logn 
(x) For « + 2km (k = 1,2,...), Zu, converges. For « = 2k7, Xu, diverges 
since (2k7)"" + 1 and 1/(2k7) < 1. 


cos = converges for all x by theorems 4.53, 4.54. & 


. Thus Xu, converges for x + 2k7. 


2. (i) All x, by theorem 4.53. (ii) x = k7(K = 0, +1, +2,...) only; use 
sin?nx = 4(1—cos 27x). 


3. Series converges for all x, by theorem 4.53. Also 


4 ‘ 
E(i+d+..+%) Sm a o(14 5+... a2) Heesom 
2 n} on 2 - A 


diverges when x + kz. |sin nx| > sin?nx. 


4. (i) If Xa,n* converged for k > 1, a,n* would be o(1), i.e. a, would be o(n~*) 
and so Xa, would converge absolutely. 

(ii) Write a,/n® = (a,n-“)n*-* and use theorem 4.54. 

Let A be the set of a for which X(a,/n*) converges. A + @, since a, = o(1) 
and so «€ A when @ > 1. Also A is bounded below, by (i). Let e, = inf A, 
so that « ¢ A when a < @,. By (ii), «¢ A when @ > &. (@) may or may not 
belong to A.) 


’ n n n 
5, Let a,—/ = b,. Then s, = >)(—1)b,+/>)(— 1)’, where }}(-—1)"b, + ¢, say, 
1 1 1 


n 
and />)(—1)" = —/ (” odd) or 0 (7 even). 
1 
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6. Extension of theorem 4.53: If s, = =. and x |U,—U,41| are bounded and 
v, > 0, then Xa,v, converges. Both extensions follow from 
An+WUnti1t --» FantpUntp 
= (Sn41— Sn) (Ong1—Onte) +... + Snen-1— Sn) Onsn-1— Unt-n) + (Snip — $n) Vp. 


Since f’’(x) > 0, f’ strictly increases. If f’(x,) > 0 and £ > Xo, then f’(E) > 0, 
f(x) = f(&) for x = & and f(x) > 0 as x >. Hence f(x) < 0 for x > 1, 
ie. f strictly decreases. Since f(x) > 0, f(x) > 0 for x > 1. As —f’ is positive 


and decreasing and | (—f’) exists, 5) {—/’(n)} converges. 
1 1 


Put v, = f(x”) sin {log f(m}. Then v, +0 and X|v,,,;—v,| converges since 


[Unt1— Un S 2|\f')| S 2|f'(~)|. 


7. If lim inf (x,/v,) = A > 0, then, for n> N, say, u, > 4Av, and so Lu, 
diverges. 

(i) up, = (—1)"/yn, v, = A/n. 

(ii) u, = 1/n when n = k4 (k = 1, 2, 3, ...), up, = 1/n? otherwise; v, = 1/n. 


n 
8. To prove that 2, Tar > 0 as n->oo, let X|u,| = o, sup|7,| = 7. Given 
é > 0, there is N such that |7,| < € and 2 A < ¢ for n> N. For n> 2N, 


\UoTn + ..-+UnTo| < |UoTn+...+Un—yTn| + |Un—yarTy—it --» + UnTo| 
SK E(|Uo| +... + [Up_yl|)+TCUn_yaal +... + [Unl) 
< (7+7)€. 
Example. u, = v, = (—1)"/,/(n+1). Then 
—Won-1 = ae Se eb ae ge ae 
VQn) /2./Qn—1) nan (Qn) 


1 
9, Let y, = (145+...+7) —logn, so that y, >y (Euler’s constant). Then 


Sn, = {log(2pn, Ee 1) + Y2mm—1} 2S: Z{log(pn ae 1) + Yon- = Z{log An *f Ven: 
10. Since the terms of the rearranged series tend to 0, we need only consider 


Son. Let p, q be the number of positive and negative terms in s,,. Let k > 1. 
Take ¢ > 0. If n > N, say, p > nand |ra,—1| < € for r > 2g. Then 


Son — (A, — A, + eee — Aon) 
= (don + dontit ... + dap—1) + (Gogt2 + Aog+4t ».. + Gon) — Gan + Ganset ... + Aoy—a) 
1 1 1 
=(+e)(5- ee eee ;) +40+6) (5+. + ‘)- -40+6) (7+... gers —), 


where le,|, lés|, lés| < € 


11. By the absolute convergence of =(—1)"/n* with k > 1 and by 9, the series 
converges if k > 1. For k < 1, the series diverges. Obvious when k < 0. 
For 0 < k < 1, 


n 
== 3 lee Oa? ne 


1 1 {> 
2 OS rk 
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4(f) 
1. (i) All z. 
(ii) z = O only. | 
(iii) R = 1, since n” < (nlog?n)'"" < n3" for n> N, say, and n> 1. 
Convergence for |z| < 1. 
(iv) R = 1. When z = cos 9+isin 9, series is 


= (= no ;sin =) 
3 \log’x logn)” 


X(sin nO)/log?n converges for all 6, X(cosn6)/log?n except for 0 = 2km 
(k = 0, +1, +2, ...). Power series converges for |z| < 1 except z = 1. 
(v) |z| < 1. 
(vi) |z| < 1. 
(vii) « = 0: all z.@ + 0: |z| < 1,2 + 1. 
(viii) |z| < 1, z + 1, 
(ix) R = 2. Find real and imaginary parts of (cos 3n0+isin 3n8)/(1 — in); 
convergence for |z| < 2 except z = 2, —1+iV3. 
(x) a = kw (k = 0, +1, +2,...): all z. @ + km: R = 1, since 


0 < lim sup |sin na| < 1 


(see exercise 4(e), 1 (ii)). When z = cos +i sin 9, series is 


Z n n 


Ms 


— n(a+0)+sin mea A), jeosme—O)— 00s mer 


Real component converges for all 0, imaginary component except for 
@ = 2kn+«a. Power series converges for |z| < 1 except z = cosa+isin&. 


2. @) lf a 0, 1; 2,,,. and n> 2, 


(I ICnea)|-F5a~ 14S +0(@). 


Use theorem 4.42. 
(ii) Since [1+(1/n)]}* = 1+k/n+O(1/n’), (*) holds if O< k<1+a. As 


(*) | decreases and 


1/n* +0, inequality (4.41) shows that G) >0. Also 


(cv 


(iii) (*) = 


a 
(") | . Use theorem 4.53. 


3. R = 1. If the series is Xa,z”, 


a, _(ntiI(ct+n _ ite C+ 0(5) 
Gee 3 (a+n)(b+n) = n n?}° 


The argument of 2 shows that, if ao (a+b) > 0, Xa,z" converges absolutely 
for |z| = 1 and, if —1 < c—(a+5) < 0, it converges conditionally for |z| = 1, 
z+1. When c—(a+b) < —1, Ya,z" diverges for |z| = 1, since a,+>0. 
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Obvious if < —1. If c = a+b—1,0 < a,/a,., < 1+(A/n’?) forn > N and, for 


> N. 
—— to A [yet : ae 
og . a ee 
i.e. i, > a,c. 


4. Gif R= S, 7, > R. Whena, = 1, 5,.= —l(allna, R=S=1,7, =. 
(ii) If a, = 1+(—1)”, 6, = 1-—(— 1)”, then R= S = 1, T, = &. 
(iii) If a,x" = 14+2x%4+2x?+4+..., Db,x" = 1-—2x+2x?-2x7+...,R=S= 1, 
is = we, 


5. Xa,z" converges for |z| < 1, since |a,z"| < a|z|". Let |z| < 1. 


a-2)(Sae") - ay = 


(aay i)2" < Ya -1—4,)|z|". 


a, = oO, =... ES 0 = ged Gn > Gr, 


co 
r.h.s. < ¥ (Gn-1—@n) = Ao—lim an < ao. 
1 


ioe) 
Lastly, >) a,z" = 0 implies |—ay| < ap. 
0 


4(g) 
1. (i) Non-existent, —1,1. (ii) Non-existent, non-existent, 0. (iii) All 0. 
(iv) 0, non-existent, non-existent. (v) Non-existent, 0, 0. (vi) 0, 0, non-existent. 
2. Min = Hy Xn = -0-otherwise. 


3. Suppose Xm, increases, lim X¥nn = Ym, lim ym = y. Since Ym Increases, 


nm—>o m—> 
Xmn < Vm < y. Hence, as m+, Xmn—>2Z, < y. Also Z, increases and so 
Zn >Z < y. Similarly y < z. 


4. Sufficiency. For n,n’ > 10, |Xnn—Xn’n’'| < €. By completeness of Z, Xnn > x, 
say. If m,n, n’ > No,|Xmn—X| < |Xmn—Xnin’| + |Xnnr —X| < 2e. 


5. Let SUD Xmn = X,X,, > X—€. Form > p,n 2 Vv, xX—€ < Xmn S Xz 


m Nn cO 
6. Smn = = D wy < des < DY 043 use 5. 


4=1 j= 4=1 j=1 L,pel 
7. Adapt the proof for }) u, (C1, 90 and 95) using 6. 


n= 


ioe) 
8. ¥ a <<) exists if and only if k, / > 1; use theorem 4.74. 


m=1 


9. k > 2: (m+n)-* < (2mn)-**; use 8. 


b= 2: Y (m+n) > iz (m+x)~?dx = (m+1)" 


and so > {z >) (m+ n)- ) diverges; use theorem 4.74. 
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10. oS i ha =) > lua, 


i=l j=] t=1.4=1 
cO 
> Umny § = p> ica 
m,n=1 m,n=1 


Take € > 0. Then |Smn—s| < €, |tmn—t| < ¢ if m,n > mo, say. Let q > 21 
and let p = [4q] (the greatest integer < 4q). Then 


Saq— Dy Ur| = m,n<q M,n<q 
r=1 mM+n>Qq MmM+n>q 
q 
and so |s— >) v,| < 36. 
r=1 


1. If w= a4+if, wo =a’ +if’, af + a&’f.|mo+no’|? = am?+2bmn+ cn’, 
where a > 0,c > 0,ac—b? = (af’—2’f)? > 0. Then A(m+n)? = |mw+no’|? > 
Bmn, where A = max(a, c) > 0, B = 2(,/(ac) +5) > 0. Use 8, 9. 


12. Let |z| < 1. ie = |z|/"/(—|z|") < |z|"/(1—|z|) and oD ( 3 lem 


cO 
exists. Hence >) |z””| exists. 
m,n=1 


Let x zm = gs. Since 
m,n=1 ; 
zi +2"/1 —2") = 24+ 2(zr 4 zten se |), 
wl 6 ES, 


E (Er)o8 (5m) - ane 8 Oe 


m=1 n=1 


Dp 

The second term is >) z"/(1—z”). Also, as p > ©, the first three terms tend to 
n=1 

s; hence so does the last. 


13. Let |z| = r < min(R, 1), r < p < R. lim supla,|"”" = lim sup|d,|4" = R7 
and so |a,|, |b,| <p" for n> N, or |a,|,|b,| < Ko for all n. Hence 
x|5,| |z”"| converges and 


oe l-r"/p ~ p=-r 


oc [e.e) 
so that >) ( » lam |bn| jem also converges. By theorem 4.74, 
m=1 \n=1 


S [ ctm)| - S| 5 aha 
m=1 \n=1 n=1 \m=1 
and this is the desired result. 

The disc |z| < min(R, 1) cannot be replaced by |z| < R. For if R < 0 and 
|z| > 1, |z|" > R for large m and for these f(z”), g(z") are not defined; while 
if R= ©, Yang(z"), Ub,f(z") ay diverge if: |zic> 1 xz. a, = 4, = Wnt. 
gmx > I). 
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5 (a) 
Rt MX) = = o(f,(x), f(x)), then M(XU Y) = max(M(X), M(Y)). 


2. fr&n > fg uniformly if (/,), (g,) are uniformly bounded (i.e. |f,(%)|, [gn()| < K 


for all x, n), since 
) F8—Take = fle—2,) +edl—S,)- 
If (/,), (gn) not uniformly bounded, counter example is 


f(x) = &(xX) = x+(1/n) on (—, ©), 


; : ee ee n 
3. f(x) = 0. Convergence uniform, for f, has maximum (=) re at ree 
or, given ¢, |f,(x)| < € on [l—e, 1] and |f,(~)| < (1-6) < € on (0, 1—€] for 


n> MN. 
g(x) = 0. Convergence not uniform, for sup |g,(x)—g(x)| > g,(1/n) = 
sin 1 +> 0. 
h(x) = x®. Convergence uniform, for |A,(x)—A(x)| = x?/(1+nx) < 100/n. 


4. f= g = 0.f, has maximum $ at (4). g, has maximum ./(n/2e) at ./(1/2n); 
Or edi) = ¢" > a. 


5. f = 0. In every interval sup f,(x) = 1 for infinitely many zn. 


6. Given gp, there is m) such that all r, with n > ny) have denominator > q; 
then sup| fn (x) —S(x)| < 1/90. 


7. (f,) converges (to 0) on Z; convergence uniform on any bounded subset 
OZ. 
(g,) converges on {0, +27i, +477, ...} U {z|rez < 0}; convergence uniform on 


{0, +27i, +47i, ...} U {zZ|rez < —d} (6 > 0). 
|sin n(x+iv)| = |sin nx cosh ny+i cos nx sinh ny| 
= 4/(sin’nx + sinh?ny). 
Hence (A,,(z)) diverges if im z + 0, (A,) converges uniformly on {z|im z =0}. 


8. No; take s,(x) = nx/(1+n?*x?). Then s, has maximum 1/2 at 1/n. If 
M(6) = sup |f(%)| 0 < 6 < 1), then M(0) > 0 as 6 > 0 and 


0<a<db 
no 


“sup [/0)s,G)| < max (4M), 5, MQ) <¢ 


by choice first of 6, then of n > 1/6. 


9. The partial sums are 1—x”, (1—x”)(1-~x) —1-(-»"]1-—»(14+-x)-1. 
Lv,, UW, converge uniformly. 


= ss oe 
10. I, y) a ee f(x, y) S(%, y)| ene yi +ny) 
11. [ee sin nx dx = | - 90) oa + [ #00 —— dx 
2M te Mie ay 


f(@)=0 and |f,(4)-F()| < 


n 
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12. Let f, > f. Take e > 0. There exists my such that o(f,(x), f(x)) < ¢ for all 
n> n,andxe Xo. Take m,n > ny. There exists x € X, such that o(f,,(X), Vm) < &, 
o(f,.(X), Yn) < €. Hence 


T Ys Yn) S TVs fnlx)) + 1 Fn), £O)) + OF); ful) + On), Yn) < 46. 


Since (Y, o) is complete, lim y,, exists. 


5 (b) 
1. Sum is x/(1 — x?) for 0 < x < 1,0 for x = 1. Use theorem 5.21. 


2. ‘Only if: (f,(—1)) diverges and sum function discontinuous at 1. ‘If’: 
When [a, 5] < (1, ©), sup |f,(x)| = a”/(1+a?") +0; other cases treated simi- 


ax<xr< 


larly. 


3. If ¢(1) + 0, sum function discontinuous at 1. If 6(1) = 0, use method of 
5(a), 8. 

4. (a) n’xe-™™, (b) (—1)"nxe— on [0, 1]. 

5. (i) f.(x) = 1/x on [1, 2], 0 on (n, ©). 


(ii) g(x) = 1/n on [0, n], 0 on (n, ©); or g,(x) = ee on [0, <0). 


6. f = 0. If 0 < p < 1, f(x) increases with x and f,(1) = n/(1+n°); if p > 1, 


f(x) has maximum An —@'P) where n?x?(p—1) = 1. Convergence uniform if 
C= 7p <2. YS, ne. 


71. f=f' =0.f, +f’ pointwise on [0, 1], but (4) does not converge uniformly 
since f-(1/n) = n/4. 


8. f= f’ = 0 and |f,—/| has maximum 1/(2”) at 1/n. f-(0) = 1 for all x. 
9. f(x) = 1 for 0 <x <1, x for 1 = x <.2.. sup [Lids ae _D). 
0<a<1 


5(c) 


‘® Abies 2 = Er = aa uniform convergence (u.c.) on (—©, 0) 
(M-test). 

(ii) Convergence for x + 0; u.c. for |x| > 6 (>0) and on no larger set since 
sup |cosech nx| = oo (use theorem 5.31). 
xr+0 

(iii) Convergence on (—, ae, U[0, 00); u.c. on (—o, —2—<d]U [d, 00) and 


on no larger set since mod = 2 and sum discontinuous at 0. 


res 
(iv) Convergence lor x <0; uc. for x < —d6 and not for x <0 since 
sup |e"*sin nx| > esin 1. 
r<0 


(v) Convergence for x + 2k7; u.c. on closed interval not including 2k7 
(theorem 5.33). Convergence not uniform on open interval with 2k7 as end point, 
for when x = 2k7+7/(8n), 

are : 2n 
GOS FA x ae Hz — 
avi WPF) * net Dae 
(vi) Convergence and u.c. on [0, oo) (theorem 5.34). 
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2. For any n, sup |1/(z?+n?)| = 00, 
zeEZ* 
3. For |x| < 4 use M-test. For 4 < x < 1, =n-? sin nx u.c.; then use theorem 
5.34. For -—1 < x < —4 series is —Xy"n-” sin n(y+7), where y = —x. 


4. For ze BO; 1)— BU; 9), 


= tn: 2 2 
"= ~ ee 
22 1-—z | [l1—z} 4 
Apply theorem 5.33 to the real and imaginary parts of 2a,z”. 
d 9 3 d : nih coll 
5. sup Pa ee = An-*, sup |—n-*| = Cogan : 
—w<g<oo |d x>1+6 |ax 


Use theorem 5.23 B. 


6. (i) If 0 < a < 2, u,(x) has maximum An-* where (2—«)n?x? = a; if a > 2, 


u,({x) increases with x and u,(1) = (1+277)-1. Thus yu, is u.c. on [0, 1] if e > 1. 
2n 2n 

Let 0 < a <1. Then  u,(1/n) = > 1/(2n*) = 4 and convergence non-uniform 
n+1 nt+t 


by theorem 5.31. 

(ii) Consider [1, 00). If 0 < a < 2 and n large enough, w,(x) decreases in 
[1, co) and u,(1) = (1+n?)-. If a > 2, u,(x) increases in [1,00): for a = 2, 
sup u,(x) = n~*; for a > 2, sup u,(x) = ©. 


N 
7. Sx" —x) <' 1; use theorem 5.33. M, > [e(n+ 1)log(n+ 1)]*. 
1 


8. u,(x) = (—1)"x"(1 — x). See 5(@), 9. 


9. (i) For u.c. use theorem 5.33. 

(ii) For any x, u,(x) = 1/(1+27°) if n is sufficiently large. For any interval J, 
sup u,(x) = 1 for infinitely many 7. 
rel 


10. Uniform convergence by M-test. 
x irrational. Each term is continuous at x and so is s. 
x = p/q. : 
(op) ; + n 
: (? : h) ee ee 
1 


q * 


Each series is u.c. (in #) for 0 < h < 1. Each term tends to a limit as h > O+. 
By 5(a), 12 and theorem 5.21 we can take limits term by term. lim [n(p/q) + nh] 
differ (by 1) if and only if m = qr (r integer). 


11. In [0, a], sin (7x”) < 7x" < Ta”; u.c. by M-test. 


1 1 7 
[sin ray dx <n [amd = 
0 0 n+1 


o rl | a i @ a 
So 3; i "sin (7x") dx converges to /, say. Since { < { 2, { < /. Thus, 
1 J0 0 ae ee 


0 


a 


a 
by theorem 5.22, { i= [ 
0 0 
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a 1 
Since [/ f increases with a, i J =- ita f< < /. 
0 


a—l1—- 


1 ae es 
Also % [= [, > increases with N and mg re | See > /—e 

0 01 7 we 
iN > Nz. 


= , if = 0, 
12. > a,(x) = 12 noes S 
see F(n) if x = 1)/n. 


Xu, is u.c. on {0, 1, 4, ...} and each u, is continuous at 0. 


1S y= (") Be pny = re 


14. Given ¢, take a = ay < a, < ... < ay = BD such that 
ff) <« # a, =x fe Gera 
If n > N, say, |f,(a) —f(a)| < ¢ (i = 1, ..., k). Let x € [a;_,, a;]. Then 
|fn(as_-) —f(x)| < 2¢, [f.(a)—f@)| < 2¢. 

Hence f(x)—2e < f,(a_1) < f(x) +26 

and f(x)—2e < f,(a;) < f(x) +2e. 

Since f,, is monotonic, 
f(x)—2e < f(x) < f(x) +2e. 


Thus, for n => N, Bors! ,l filx)—f(x)| < 


5(d) 
1. (i) For z + 0, (sin z)/z = 1—2?/3!+... >1as z 0, by theorem 5.42. 
Gi) For z+ a, (cos z—cosa)/(z—a) = —2sin 4(z—a) sin 4(z+a)/(z—a) > 
—sin a as z >a, by (i) and by continuity of sine (as sum of power series). 
(iii) (e* — 1)/(e*— 1)? > 1 asz>0. 


2. (i) cos(sin z) = 1—422+3%z!+... (all z). 
(ii) eN4+) = e(1—z+$z?—3234 $424+...) (|z| < 1). 
(iii) sin (cos z) = sin 1 cos(—4z?+342z4—...)+cos 1 sin(—4z?+42!-...) 


= sin 1—4(cos 1)z?+ (s¢ cos 1—} sin 1)z*+... (all z). 


3. For x + 0, f(x) = exp[(1/x)log(1+x)]. Since f(0) = e, for |x| < 1, f@) = 
exp (1—4x+4x?-—Aix?4+...). 1-—4x4+4x?-—4x'+... has derivatives of all orders 
at O and so therefore has f. For small |x|, f(x) = e(l—4x+33x?—-7ex+...). 
Thus f” (0) = —4e. 


4. (1) a, xX" bak", Bok tides radii of convergence > 1. Then (Za,x”)(Zb,x”) = 
aut, x" for Lx < 1. Let x > 1- and use theorem 5.47. 
(ii) Ifa, = by = (—1)"*1/J/(4+ 1), then |c,| > 1. 
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5, (i) La, x” is u.c. for 0 < x < 1. Hence { (<a, x9dx = 2 [ ax" dx. 

(ii) Da,x"*1/(n+1) has radius of eacabics > 1 and so had xa,x". Hence, 
ae 0-«- 2 < <1; [ (Za,x") dx = Lané"/(n+1) > La,/(n+1) as §>1-, by 
theorem 5.47. : 


6. By 5, sum of series is 


1 dx 1 i x+/2 x—/2 ) a 


o 1+x* ~~ 8 e+ /2xt+1 x?- J2x+1 
7. Coefficient of x”: 
n—1 —1)r-rt — jt (—1)""2! 1 1‘ (—1i)" 
2 1) 2 eee Serene =) ick : 


r-1 “-r r i get wr Tf 


ta 
Valid for x = 1 as s,_,/n decreases and tends to 0. 


8. ” (") converges, by 4(f), 2. Hence, by M-test, 2 (*) x7 46 01.0: tor 


> 0: 
x <1; the sum is (1+x)* for —1 < x < 1 and, by continuity, for 
1. 


i 
1+ MA 


x 
a> —Il: z(’ \2 converges, by 4(f), 2. 2 (* x is u.c. m [0, 1] by theorem 
5.34 and in [—1+0, 0] by M-test. 


9. By 8, 1a pat gat. .. converges uniformly to (1 +x)? for 0 < x? < 1, 


ie. |x| < 1. Integrate. 


n 
10. Put > (")( p = ¢, For |x| < 1, Ze,x" = & (") ae ® oS = (1+x)*t4, 
fa 7 NaF nN nN 


By theorem 5.43, c, = ‘ee ; 


5(e) 
1. If f is a continuous nowhere differentiable function on [a, b], define g, 
em =1, 2, ...). on fa, b] by 


ie [ Sidewee [ eid Gane =) 


Then g™ = fand so g, is the desired function. 


2. Since Eu,,/n? is u.c., the sum g is continuous in [0, 1]—{r, re, ...}3 at Pn the 
left and right limits exist, but are unequal. Each u, is integrable and so is g. If 


A(x): = ie g, h is continuous everywhere, but is not differentiable at r,, for the 
0 
left and right derivatives differ. Finally, f(x) = h(x)+ Dur,/n’. 
1 
3. Take f(x) = 1/x forO < x < 1. 


4. F< B(O:r), say. Take R > r, Fy = R*'— BOO; R), F, = FU Fo. Let fy be the 
extension of f to F, with f(x) = 0 for x € Fy. Apply theorem 5.52 to f,. 
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5. Given €, choose 6 so that o( f(x), f(x’)) < € when x, x’ € E and p(x, x’) < 4. 
If x, x’ € E and p(x, §) < 40, p(x’, £) < 40, then o( f(x), f(x’)) < ¢. By exercise 
3(d) 1, lim f(x) exists; define f*(£) to be this. 

x—> 


ee 
Let u, u’ € E be such that p(u, u’) < 40. There are x, x’ € E such that 
pix. x) < 6 of), f*)) < & 00) Fe) =e 
Then o(f*(u), f*(u’)) < 3e. 


S3(f) 
1. |f(x)| < M, say, for a < x < b. Take € > O. There exists a polynomial p 
such that |f(x)—p(x)| < ¢ fora < x < b. Then 


*b b b b 
ee [ su-p+ [9 =| f0-p < (b-a) Me. 


Fence 1. f° =v. 
a 
If f +0, there exists ¢[a,b] such that f(€) = 7 + 0. By continuity, 
f(x) > 4? fora < x < f, say. Then 
b B 
[ i > | f? = 4n°(B—a) > 0. 
a a 


2. The set A of trigonometric polynomials is an algebra in C[«, 2] which con- 
tains the constant functions. If @—a@ < 27, A separates points for, when 
0 < |x,;—x,| < 27, at least one of cos x,—Ccos Xs, sin x,;—sin x, is not 0. If 
P—«a > 27, clearly A does not separate points. Then only continuous functions 
with period 27 can be approximated. 


3. The even polynomials on [a, 5] separate points if and only if [a, b] < (—2, 0] 
or [a, b] < [0, 0). 


4. For each x, y,(x) > (x), where y(x) = 4{y?2(x)+1—x*}. By first part, 


y(x) = 1—|x|. Convergence uniform by theorem 5.35. y, is a polynomial 
(of degree 2”). 


6 (a) 
1. Consider Z’ with one point of division added to those of Z. 
2. When k > 0, SQ, kf, g) = kS(Q, f, g) and SQ, —f,g) = -—sx(Q,f, g). 


3. SD, Ff, 21+ 82) = SQ, f, 2:)+ S(Q, f, g2) and D may be chosen so that all 
three sums lie within ¢ of the corresponding upper integrals. 


4. Follows from 3. 
5. SF, fi, 2) < SQ, fr, g). 
6. f(x) = 1 for x rational, —1 for x irrational. 


7. For x rational, f(x) = 1, h(x) = 0; for x irrational, f(x) = 0, h(x) = 1; 
g(b) > g(a). 
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8. If m, M are the infimum and supremum of / on [a,6], then 
mh(x) < f(x)h(x) < MA(x) for a < x < band so 
b b b 
m | hde < | thdg < m{ hdg. 
a a a 


9. Given ¢, there is 6 such that |f(u)—f()| < ¢ if u,v €[a, b] and |u—v| < é. 
When (2) < 46, S(Z)—s(D) < eg (b)—g(a)} and 


b 
s(Z) < [ fdg < S(Z). 
a 
10. (i) |F(v)—F()| < K{g()—g(w)}, where K is the supremum of |f| in [2,6]. 
(ii) If m(u, v), M(u, v) are the infimum and supremum of fin [u, v], 


gv)—g) — Fo)-FW) — Mu ») SO=8H) 
a eee , v-u 


m(u, v) 


11. There is a dissection D of [a+e, b], say até = x1 < Xp <1. < XR =D 
such that S(D’)—s(Z’) < ¢. If D is the dissection a = xX) < X41 < ... < Xn =O 
of [a, b], then S(Z)—s(D) < (M—m)e+e, where m, M are the infimum and 
supremum of f on [a, 5]. 


12. Let c be a common right discontinuity. Any 2 has points x;_,, x; such 
that Ni SCS XR Then 


S(D)—-s(D) = (M,—m){e(x) —e(%i_s)} = |f(e+)-fOl{e(c+)—-g(©)} > O. 
13. For n > N, say, |f,(x)-f(x)| < € (a < x < b). Thus 


7b b b b 
i "fae i "fide < f ea) ae [ Gp de = Biss). 


Also, for n > N, 


b b 
[ files [ fag eo 


14. If f(x) = 0, let sup f(x) = M > 0. There is an interval [«, 6] in which 
f(x) > M-e. eae 
M"{g(b)— g(a)} 2 [ rae ik f'dg > (M—«)"{g(f)—8(@)} 
and, for 7 sufficiently large, 
{g(b)—g(@}i* < 1+¢6, {g(P)—g(@y" > 1-«. 


6 (b) 
z 0 (x-=: 0). 
n 1 1 1. (a, 2 0), 
See ¢ a —-}|= 
ea | tc ai i * ig (up < 0), 
Pin (x = 1); 


also fis linear in each interval 1 -- ,1- ae and f(1) = 0. 
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3. g,(x) = x" on [0, 1], f monotonic on [0, 1] and discontinuous at 1. 


4. f,g, have no common left or right discontinuities and nor have f, g, by 
uniform convergence. Thus fe R(g; a, 5b). Then use integration by parts and 
exercise 6(a), 13. 


6 (c) 
1. Necessity obvious. To prove sufficiency adapt the proof of the completeness 
of R' given in §4.2 (p. 81); or use the completeness of R! as follows. The 
sequence 


at+n 
é = ‘ tde° (v= 1; 2,2.) 
a 
x 
converges. Also i fdg>0 as X>o. 
[X] 


2. Use theorem 6.31; or write f = (|f|+/)—|/| and note that | (f|+/) dg 
a 

exists, since 0 < |f(x)|+/f(x) < 2|f(@)|. 

3. Integrate by parts. 


4. To prove convergence use integration by parts. Also 


(n+4)7| cos x 1 (n+4)" 2 
[oe |cos x| dx = ———_—. 
(n-4)2 |X+sin x (n—3)7 Jn-4)0 (n—2)7 


(n+1)*t1-—n2+ = fat l\ 1 
ee r ya 


; a+] 
as n > ©, since the power series & ( x"! has radius of convergence 1 and 
r 


is therefore continuous at 0. 
(i) Let a > 0. Since f(n+1) < f(x) < f(y) for n < x < n41, 
(n+ 1)%t1— n%+1 (n+ 1)%t+1— n*+1 
a+ 1 a+1 


f(n) 


n+1 
f(nt+l1) < i) x“fOdx s 
and so, for n > N, say 
n+1 
4(n+1)%f(n+1) < 4n*f(n+1) <| x%f(x)dx < 2n*f(n). 
LX] xX # (X] 
For X > N, 4 nf(n) <| x*f(x)dx < 23) n*f(n). 
N+1 N N 
(ii) If a < 0, x*f(x) decreases and (i) may be applied to x°(x*f(x)). 


b 
6. Let f(x) = 0. fdg is bounded and increases as A decreases and so 
ath 


b 
lim Sdg exists. In general, write f = (|f|+/)—[/]. 
h-0+ Ja+h 


If fe R(g; a, b) and g is discontinuous at a, then f is continuous at a. Thus 


ath 
is fide > Hialeah es kn 
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7. iB log x : ( X log x 
dx = lim lim dx| = 0. 
0+ (x+ 1)? X-o \6—>04+ 76 (x+1)? 


8. 0O<a+1<fandf<a+1<0. 


9. By the M-test, there is uniform convergence on [, £] and so, by theorem Bi22 
(or exercise 6(a), 13), 


00 B 
>, x"te-1 log (1/x) dx = [- 


n=0 Ja 


x° log C/x) 5 
dx 
1-x 


Applying the es of theorem 6.33 we first get 


3 [xt og (ia) dx = | ee OE 
0 


n=0 1-x 
and then 
= 1 1 ye-1 Jog (1/x) 
= xnrtre-l ss ee 
2 , to? Pes log (1/x) dx [, ac dx 


10. On (0, Y], e~”” < e*/¥. Also uP [ve wn hag. ms on ye-*l¥ = @, 
= >0 


a ie cos x d= | sin x |+ +[ Sn x | 
: 2 xt+siny Lx+siny 2 G+” 


and consider each term. 


i 
12. (ii) ra = | 1+ me arc tan Xc. 
= _ydx 1 
(iii) sup | 14x aa 
6 (d) 
1. If |f()| < K fora <x <banda<s<t<b, |f(t)—-f(| < K¢t- s). 


2. | |FMI-|fO| 1 < [FO-S(9I. If |FC0|, |g@d| < K in [a, 5], then 
fOe)-fOg(9| = |fOlLe@)-g¢O} +eOSlO-LO} 
< K{|f()-f(9)| + le@@O—g@)I}- 
3. PD, f)+ AGA) = VBS), PAA-AD.S/| = fO-f@:; 
PQ,f) = KVODA+fO-f@s, ADS = KVGN-fO+f@}; 
Pf) = HVA) +fO)-f@}, QS) = HV2AS)-fO) + f@)}. 


b 
4. For any 2, VQ, F) < | |f\|dg. Take D so that SD, f, 2)-0Df, 8) < €. 
a 


For each i, there are A;, “4; such that 


2 fdg = Axg(x) —8%i-v}, 


Vi—1 


L lf|dg = Mikg(X:) —E(%i-1)}. 
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-Then 0 < u;—|A,;| < M,;—m, and so 


b n 
o<| Ifide- 3 


b 
Thus VF) => V@,F)> { |f|dg—e. 
a 
For the last part, use 3. 


Xi 
fdg 


Ti—1 


< TM m)(E)- 8-0) < 6, 


0 (0 < x < I/n), 


_ 0 (x = 0), 
5. f(x) | tie) fee eS. 


x cos (7/2x) (0< x <1); gues | 


6 (e) 
z£ 
1. By exercises 6(d), 3, 4, G is of bounded variation and p¢(x) = [ gt, 
a 


of 07 
G(x) = | g~. Hence, by theorem 6.21 B, fe R(pe; a, b) N R(gg; a, 6) and 
a 


[ fab< = { fet, | fdae= | fe 


2. Since s(F, f, p,) < XUf(E){p.(x:) —pAxi-)} < S(D,f, py), by exercise 6(a), 9, 
b 

Uf (E){Poxi) — po(xi-1)} ~| fdp, as wD) > 0. Replace p, by g, and subtract. 
a 


3. For every n, f, € R(p,; a, b) nN R(q,; a, 6). Now use exercise 6(a), 13. 


b 
4. Choose f,g so that i) fdg +0, g(b)-g@=0; eg. f(x) = cosx, 
a 
g(x) = sin x on (0, 7]. 


5. Let |g(x)| < Kforx > aandletO < f(x) < eforx = X. By exercise 6(a), 12, 
f and p,, f and g, have no common left or right discontinuities. Hence, by 
theorem 6.54, if X, > X, > X, 


X2 X2 
| is fide = | (xX) e(X)—f(Xe(X)— “Is a < 3Ke. 
7. (ii) [ S22 | atcos x)= "Ss (- iy 2 = "S- 1)"*1u, and 


=(— 1)"t!u, converges by the alternating series test. Also, toe 0 - a<T7, 


a 
6 (f) 


1. We need only consider the cases where f,*(x) and f*.,(x) are defined by 
different formulae. For instance let f*(x) = f,(x), so that |f,(x)—A,(x)| < K. 


If fii) = Inui) + K, 
i= —fr(x) 2 > (nai) + K) = n(x) + K) = Mnys(x)— h(x) = 0 
If frs(x) = Mnii(x)—K, so that frii(x)—hnii(x) < —K, 
fri) L(x) > fri) —frlx) = 0 


== d(cos x)| < 
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oe r+ b 
2. Necessity. (i) gives $(b)-— (a) = 0. (ii) [ ‘ fe, 3 AP S +| : + i See 0. 
5 a a Hf 5 


xc+é b 
{ |< v2t°(g); I <7 
x | xr+6 


Since ¢ is continuous at x, V2t°(¢)>0 as d> 0 (theorem 6.44) and so 
$(x)— 9(a) = 0. 

Sufficiency. Take e > 0. By exercise 6(e), 2, there is 6 such that, for any dis- 
section D: a = Xp < xX, < ... < Xn-1 < Xn = Owith MF) < 4, 


[= 9-60: 


b 
zy (E.) {P(%1) — P(%i-D)} — i. if ae <6, | *) 


But, since ¢ has only countably many discontinuities, x;, ..., X¥,-1 can be chosen 


b ; 
to be points of continuity of ¢, so that the sum in (*) is 0. Thus | { a a < €, 
a 


3. g = g*+¢, where 


and ¢ is a function of the kind described in 2. Note that ¢ may be chosen so that 


0 fara <2 €-c, 


SC 
i: for “¢ <x =D. 


g(x) = | 


6 (g) 


1. Follow the proof of theorem 6.71 with two changes. 
(i) In the notation of that proof, 


0 < SD)-S(D) < (M-m){g(x:)—g(%i-v)}. 


(ii) Since g is uniformly continuous, we may choose 6 so that, if0 < v—u < 4, 
then 0 < g(v)—g(u) < é/no. It follows that 


0 < S(D3)—S(B*) < (M—m)nfE/no) = (M—m). 
2. On [—1, 1] let 


1 for —1 
0 for 0 


If 0 is a point of division of 2, S(Q, f, ¢) = 0; otherwise S(Q, f, g) = 1. 


0 for —-1<x<Q, 
< 


1 for Ge x ed. 


f(x) = g(x) = | 


3. Adapt the proof of theorem 6.71. If Z’ is obtained from F by adding new 
points of division in the subinterval [x;_1, x,] of Z, then 


0< VD)-V(D) < v= (f). 


Choose J, so that V(Do, f) > V°(f)—e. Let my be the number of subintervals 
of Dy. Since v; is continuous (theorem 6.44), there is d such that V?(/) < é/n) 
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if 0 < v—u < 6. Let Zs be a dissection with “(D;) < 6 and let Z* be the dis- 
section with all points of division of Z, and Y;. Then 


0 < ViO*,f)—-V(QB3,f) < no(E/no) = € 
andso V°(f) => VQ;,f) = V@G*,fy—e = VD, fy—e = V2(f)—2e. 
Similarly, if V(Z, f) is unbounded, V(Z;, f) > © as 3d > 0. 


4. On [—1, 1] let f(x) = O if x + 0, f(O) = 1. If 0 is a point of division of J, 
then V(Z, f) = 2; otherwise V(Z,f) = 0. 


6. For any dissection of [a, b], 
SD fede aadt DY xd/O)-Ser} = LO af. 
If f(x;) = t,, this can be written 
= fa Jie x fA G1 = bh ae. 


Since f is uniformly continuous, max (t;—?¢;.;) >0 as max (x;—~x;_,) > 0. 
Use theorem 6.72 (i). 


6 (A) 


1. [a, b] may be divided into a finite number of subintervals [a;,, b,] such that f 
is continuous in [a;, b,] except at one of the end points. Use exercise 6(a), 11, 
theorems 6.12, 6.16. 


3. Let &,, ..., &; be the limit points of E. Outside U(&,—«, & +¢) there is only 
a finite number of points of E. These may be covered by intervals of total length 
less than ¢. Hence E may be covered by intervals of total length less than (/+ 1)e. 


4. By 3, c(E“-”) = 0. The proof of 3 is easily adapted to show that, if D is 
bounded and c(D”) = 0, then c(D) = 0. Hence c(E-”) = ... = c(E) = 0. 
5. Let [a, b] > E,U E,. Since yz,, Vz, are integrable over [a, 5], so are 

NE, uE, = (xe, + Xz,)+ |Xe, —Xz,|}, XZ,n£, = (Xe, + Xz,) — |Xz, — Xz,|}- 
Also NE, UE, + XE,nE, = XE, +XE, 
6. By 5, 0 < c(E, U E,) < c(E,)+c(£,) = 0. By induction, c(~£,U...U £,) = 0. 


For the second part, let r,,r.,... be the rational numbers in [a, 6] and put 
E,, — {r ass 


7. h(x) = f(x)—g(x) = 0 except in a set of zero content. Thus (see theorem 


b 
6.83) | A exists and is 0. Hence 
a 


Tb Tb b Tb Pb b ; Tb 
[ir fer fa Bex few fi 
a a a “7 @ a a a 
8. If E = {1, 4, 4, ...}, c(E) = 0; and, if fis assigned any bounded set of values 
on £, then fis bounded on [0, 1] and continuous on [0, 1]—E. 
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9. Let 


fr{x) = 


1/q if x = p/q, where p, g are coprime and 1 <q < a, 
0 otherwise. 


then f, > /f uniformly on [a, b] and each f, is integrable since f,(x) = 0 except 
at a finite number of points. 


10. Take ¢€ > 0. Choose J so that 
b b 
[ fr-e < (9,f) < SQ,f) < fite. 
a a 


Since f(O)-f@ = Uf ENi— Xi) (%i- < $i < 3), 
(2,f) < f(b)-f@ < SQ,F. 
b 
fo-fa)- |. f' <6, 
11. f is not differentiable at 0, since {f(h)—f(0)}/h = sin (log h) oscillates as 


h>0O+. For 0<x <1, f(%) = sin (log x)+ cos (log x); so f’ is bounded 
and continuous on (0, 1], therefore integrable over [0, 1]. Also fis continuous on 


[0, 1] and so, by theorem 6.84, f f’ = fQ)—-fO) = 0. 


Hence 


12. {f(A —fO)}/h = hsin (7/2h?) >0 ash >0+. For0 < x <1, 
f(x) = 2x sin (7/2x*) —7(1/x) cos (77/2x?). 
Thus /’ is not bounded on [0, 1]. But 


1 | 
{ f’ = fQ)—f@) = 1-6 sin (7/262) > 1 as 3>0+. 
8 
13. Let f’ = g = gt-—g-. Then g, g*, g~ are continuous and 


f(x)-f@ = [e = [e- ea = $(x)—(). 


6@ 
H 
1. a,b €[c, d]. Define F on [c, d], G on [x, f] by FG) = [ f, G = Fod¢d. Then 
a 
justify the following steps: 


B B b 
i (fo#)'¥’ = ir G’ = G(8)-G@ = F®)—F@) = [ f 


2. Assume 0 < a < b. When 0 < 6 < X, by theorem 6.92 (or 1) and exercise 
6(a), 8, 


[PLO atx = [Oa [PLO a = Ay Ax) tos, 
ax 


ry x ao 


where A;, Ax lie between the infimum and supremum of fin [ad, bd] and [2.X, bX] 
respectively. 
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3. In the first part f is decreasing. Define f* by f*(a) = f(at+), f*(x) = f(x) 
for a < x < b. Then f® is decreasing and, by theorems 6.83, 6.93 (i), 


2 ce [re = f*(a) ['« = flat) [-« 


b 
4. Adapt the proof of theorem 6.93 (i): begin by considering | fd(g+1,), 
a 


which has an increasing integrator. 
For the last part take fon [a, b] to be continuous and decreasing with f(@) = 2, 


f(b) = 1; g(x) = O fora < x < b,g(b) = 1. Then [ fdg = \and fa) | dg = 0 
fora <6 <5, 2f0re = 3b: 


ee 
5. Let | | 5 | < K. If g decreases, then g(x) = 0 and 
a 


J 


6. If ¢ decreases and |g(x)| < K, then g(x)+K = 0 and 


= eon f < 2K|g(X)|>0 as X70. 
XxX, 


[fete < 2K | fo as X,>0., 
Ad = 


= 
= te(X)+K) | f 
xX 


7. a <0. (2/m)x% < sin(x*) < x* (x = 1); use comparison principle. 


Bees 


pe 
ee | | sin (x*)dx eh u® sin udu, where 8 = (1/~)—1. Since X* > 00 
1 1 


(oa) [o.@) 
as X >, { sin(x*)dx exists if and only if { u® sinudu exists. When 


(2n+1)7 
0<a <1, >0 and | u’ sin udu > 2. When a > 1, # < O and the 


2n7 
second mean value theorem (or 5) gives convergence. 
8. (b) Use (a) and the fact that x/(x+1) is bounded and monotonic. 
10. For y > 0, 


Xe 
| | e~*¥ sin (x/y) dx 
x 


oe 3 
exe | sin(x/y)dx| < 2ye"*14 < 2X7 'e. 


: SS f(x)| < M for xe BEE; . a the notation of thoerem 7.14, if 
Pa fiE+ DLO] < E [LE+ KO -HE+ kK) 

< & [hl < mM 
bac ‘|fE+D-/O| < amM Al. 


3. Let |lu) = 1. [fE+m)—f() — Df(E\(tw)|/||tul]| > 0 as t > 0. Since |tul| =|c| 
and Df(é)(tu) = tDf(€)(u), this implies {f(E + tu) — f()}/t > Df()W). 
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4, By theorem 2.21, |Df((u)| = (Vf)7.u < || V/E©|| whenever |lu|| = 1, 
and equality holds if and only if wu = up. Now |Df(&)(uo)| = sup |Df#(E)@)| 


= || Df(€)||. Use 3. Iu||=1 


5. Let w2+v? = 1. Since {f(tu, tv)—f(0, 0)}/t = u?v, Du»f(, 0) exists. Also 
|f(tu, tv) —f(0, 0)| = |u®vt| < |t| and so fis continuous at (0, 0)”. 
If fis differentiable at (0,0)7, then, since D,f(0,0) = D.f(0,0) = 0, Df(0,0) = 9. 


= |f(tu, tv) —f(0, 0) — O(tu, tv)|/|t| = |u®v] +O as t>0. 


6. Since f(w, 0) = f(0, w) = 0 for all w, D,f(0, 0), D,f(0, 0) exist and are 0. 
But, when w + 0, D,f(0, w), Dz f(w, 0) do not exist. f is differentiable at (0, 0)”, 


since | fl, K)- FO, 0) — O(h, 1)| = |hk| < 402+ K) = 41@, IP. 
7. If A, #, v are linear functions on R™ to R", R®, R"*? such that v(f) = 
(A(h), #(A))*, then 

IfE+H—fO—rA|? = |}6E+4)-—¢E@)-AM)|?+ |YE+H-¥O--@/?. 


= D,fla, b) = ¢@Vve), D.fa,b) = ¢@y') 


and f(at+h,b+k)—f(a, b)—hD, f(a, b)—kD, f(a, b) 
={P(ath)—d@}~bt+k)+ YOtk)—YO}o@ 
—hd’(a)pb)— ko@y'() 
= {hd'(a) + 0(|h|)} (6+ k) + {kyr'(b) + o([k|)} O(a) 
—hd’@yb)—ké¢@y(6) 
=0(|(A, k)||), since Y(b+k)—yb) = of), Yb+k) = O()). 
9. If g: R! > R'is a continuous, nowhere differentiable function (theorem 5.51), 
define f: R? > R! by f(x, y) = g(x) for all (x, y)” € R’. 
10. D.f; (i = 1,237 = 1, 2,3) is clearly continuous everywhere. Hence / is 
everywhere differentiable. Also 
|| Df(x, yA, k)|| = (22+ k? sin?x)? < |(h,k)| for all (h, k)7 € R? 
and || D/(, yh, 0) = || = |G, 0]. 


7 (b) : 
1. Let g: R™° > R' be given by g(x) = |x| = JGit+...+x2). Dig,..., Dng 
exist and are continuous except at 0. Hence g is differentiable except at 0 and 

|f|| = gofis differentiable at £ if Df(£) exists and f(£) + 8@. 7 


2. |f2 =f. is differentiable by theorem 7.21. If |||? is constant, 
© = Di f|?@ = 2f€). Df). 


3. (i) For fixed t>0 define *:X—> R? by (xX) = g(tx) = t*g(x). Then 
D(x) = tD,g(tx) = t*D;g(x), so that D:g(tx) = t*-*D,g(x). 

(ii) For fixed x € X define ¢ : (0, 00) > R? by A(t) = g(tx). 

If f: (0, 0) > R® is defined by f(t) = tx, then ¢ = gof and, by theorem 7.22 
(see note), 6'(t) = Def) )SF'M) = Detx)@). 
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Necessity. (t) = t%g(x) and so at*-1g(x) = ¢’(t) = Deg(tx)(x). Put ¢ = 1. 
Sufficiency. t?’(t) = tDg(tx)(x) = De(tx)(tx) = ag(tx) = ad(t). Thus, for 
t > 0, (d/dt)(t-*d(t)) = 0, i.e. t-*d(t) = C. Since t = 1 gives C = 9(1) = g(), 
g(tx) = P(t) = t7g(x). 
4. (fop)(r, 6) = (r1-* cos 9, r!-* sin 6)7 and so /(fo p)\(r, 6) = (1—a)r1-?*. Since 
also Jp(r, ¢) = r and 
J(fop\(r, $) = Jf(r cos g, r sin ¢)Jp(r, 9), 
Jf(x, y) = Jf(rcos ¢, r sin ¢) = (1—a@)r-*. 
5. If f(x) ee O1 25 ys)? 
a _ Vatys aoe e 
Xi = \VitVety3, X= Vitvetye” X3 Yet Ys 


provided that y,+y2t+y3 + 0, yetys + 0. Hence 
Y = (i, ye, Ys)" |VityetYs +0, yotys + O}. 


1: Y > R? is differentiable, since all partial derivatives exist and are continuous 
on Y. If x = f-{)); 


JF) = 1/Jf(&) = 1/(4x2) = 1O01+y2t+I02+a- 


7 (c) 
1. Let h = v—u and define ¢:[0,1] > R! by ¢(t) = flut+th) CO < t < 1). 
Then ¢ is continuous at 0, 1 and differentiable in (0, 1) with #’(¢) = Df(u+ th)(h). 
Apply mean value theorem to ¢. 


2. If v = u+2z7, f(v)—f(u) = 9. But, for all € € R}, 
|| Df(£)(277)|| = ||(— 27 sin &, 27 cos €)7|| = 27. 


3. Let A = f(v)—f(u)— Df(u\(v—u). Nothing to prove if A = 6. If A + @, 
let w be unit vector such that ||Al]] = o.{f(v)—f(u)— Df@w(v—w)}. As in proof 
of theorem 7.31, there exists 7 in (0, 1) such that w.{f(v) —f(u)} = w. Df(u+Th)(A), 
where A = v—u and so u+TheS. Then |All = |w.{Df(ut+Th(h)— DfWw}| 
< |Dfu+th)— Df@)| |Al. 


4. Let p(K, G’) = 26(>0). If H = U B(x; 4), then A is a compact subset of G. 
K 


we 
Thus f, Df are uniformly continuous on H. (i) Now immediate. For (ii), if 
xe€Kand |Al| < 0, by 3, 


Ifa+h—-f&)— Df~M||/||Al se [Pf — PIO. 


B(x; ||h 
5. Each sequence (D,;/;) converges uniformly on every compact subset of G, 
since 
ID: f'O)— Di f7@)| < || DF“@)— Df’) |. 


Suppose (f”(€)) converges. If x is any point of G other than &, € and x may 
be joined by a polygonal path in G with segments parallel to the axes. Integrating 
the continuous functions D, f? along these segments we find (theorem 5.22) 
that each sequence (f/(x)) converges and so (f’(x)) converges. The same argu- 
ment shows that the convergence is uniform in every closed ball in G. 
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Let K be any compact subset of G and put p(K, G’) = 20 (>0). By the 
Heine—Borel theorem, there are finitely many open balls B(x’; 6) which cover 
K. (f”) converges uniformly on each B(x*; 6) and so on K. 

Take x € G. Theorem 5.23 (A or B) applied to straight line segments through 
x parallel to the axes shows that the D,f;(x) exist and equal lim D, f;(x). 
Since the D; f} are continuous, so are the D; f;. Hence fis continuously differen- 


eens pfs) PP@)l"< EIDAG- DFO! 


Df?(x) > Df (x). 


7(@) 


1. (i) When m =n = 1, existence of Dwf(€, 7) equivalent to existence of 
Df (é, 7) G@ = 1, 2) and this does not even imply continuity. 

(ii) All partial derivatives exist in the open ball (see p. 215) and are con- 
tinuous at (&, 7)" (as on p. 211). Apply theorem 7.14. 


2. D.f(x, y) = x+y" > Ofor allx,y > 0 
Dif(x, ) = y-x7t =O0eyH=xt; f(x,x4) =0<ex=e. If solution 
through (e?, e-#)” is y = ¢(x), 
t Pe p(x) — x7} 
PO) = ~ 5+ 6G) 
Since ¢(e*) = e-? and ¢’(e*) = 0, 6’(e*#) = —he-#. 


3. Let € + 0. By the mean value theorem, f(€+ y)—f(y) = Ef’(y+é), where 
O<a<1. Asyrto, Ef(yt+aé) > +o or Fo according as £20. In 
either case there is an 7 such that f(§+ 4)—/f(7) = f(&). 
If F(x, y) = f(x+y)-f@)—-S0), DoF E, 1) = £'(E+1)—f') + 0, since § + 0. 
Last part: f(x) = 1+ x". 


4. E.g. (a, 0, 0)", (0, a, 0)? when a + 0. 
5. Jin f(x, y, z) = 3xy® and f(0, y, z), f(x, 0, z) cannot be @. 
-+ -4 0 
6 (-t “29, 
b 4 0 


7. E.g. when u,v, w < —1. Jy = Iter > 0, 


5 —X ye® —e"r? 
greg a ama ze" 
xe" —yewt” —Z 


8. E.g. (x, 0,0, v)? satisfies the equations if g(v) = v—sin(cos v) = 0; and 
2(0) < 0, 2(v) >~masv->o., | 
Ja) = —e"cosv + 0, for cose =0>x=0>0=0. 


7 (e) 


1. Counter-example: In R!, f(x) = x’, € = 0. 
If the local inverse ¢ is differentiable, 


I= D(gof\(§) = DO(f())o Df) 
and so 1 = Jp(f(&)).JF(§). 
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2. Jf(x, y) = sin(y—x). For all 0, 0’, 

f(a+6, a+2n7 +0’) = f(at+o’,«+2n7+0), 
f(a+6, a+(2n+1)7+0) = (0, 0)?. 


3. : 
—x* xy zx 
——[{xy —y yz 

2XxyZ : 

zx yz —Z 


4. (1+x?+4+ y?)*. 
7. Counter example: if f: R? > R? is given by 


f(x, y) = (e*cos y, e*sin y)*, 
then f(R?) = R?— {9}. 


TYP) 


1. Jacobian x? sin y. 


2. If VY: R?>R' is given by V(u, v) = 2e"**"—ut+v, VUCO, y), g(, y)) = 0. 
Also, on all open sets, + 0; for, given u, ‘Y’(u, v) strictly increases with v. 


3. The origin in R® satisfies the equations and J) = 27u*v?w?+1 > 0. Hence 
there exists [—a, a]* x E=%, /]® in which there is a unique solution. Also 


= =e 34 
es 27u*?v?w" + 1 < 
in (—@, «)*. 
4. If {(G) has no interior point, define ‘’ : R" > R' by ¥(y) = 0 for ye f(O), 


¥(y) = 1 for y €f(). 
If f(G) has an interior point, then, whenever ¥'(f(x)) = 0 for xe G, Y(y) = 0 


in an open set containing that point. 


5. In the definition of functional dependence we may clearly replace the con- 
dition Y = 0 in any open set by (C):: ¥ = 0 in any open set containing a point 
of f(G). Then (C) + for all y € f(G), y is not an interior point of {z|‘¥(z) = 0} => 
for all ye f(G), y € support of ‘ = f(G) © support of ’. 


6. Let & be as in the definition of functional dependence of f,, ..., f,. Define 
¥* from Rt! to R' by 


ge ded bP aN Yno Yn+1) = YO1, eeeg Yn)- 


Then ¥* + 0 in any open set in R"+!, since otherwise ‘Y’ = 0 in an open subset 
of R". Clearly ‘¥'*(/(>), ..., fn(x), g(x)) = 0 for x EG. 


7. If g(x) = OC/,(X), ..., fa(x)) for x € G, define ‘Y’ from R”*? to R* by 


PO's «+02 Yue Vat = D5, «25 Vn) Pacie 


Then {y|’(y) = 0} has no interior point; for if ‘YQ, ..., Ya» Yn+i) = O, then 
Fy, 109 Pas Vas) + O When ¥,43 F Yor: 

8. f(R!) = {(0, 0)7}U {((t, +f? sin t-*)7|t > 0} and this set clearly has no interior 
point. 3 
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f, is not functionally dependent on f,, for (x*) is even (all ©), while x*sin (1/x) 
is odd. 

fy is not functionally dependent on f,, for (x4 sin (1/x)) has the same value 
for x = 1/(nm7) (n= +1, +2,...), but x? has not. 


9. Take ® = gof-. 
10. h = 2(f?+8?)?cos[f+(f 24 92)3]h, 
11. h = 4G3fg—-f"). 


7 (g) 


1. D,f() (i = 1,...,m) exists. The real function ¢, defined by g(t) = 
Ff (ty Eo5 «++5 m) has a maximum or minimum at £, and so D, f(£) = ¢,(,) = 0. 
Also D,f(€) = 0 Gi = 2, ..., m) and so Df(&) = ®. 


2. Since G is compact, f assumes its bounds (say k, K) on G. If fis constant in 
G, then Df(x) = © for all xe G. If fis not constant, k + K and f assumes at 
least one of k, K in G. At such a point f has a maximum or minimum. 


3. (2x+y, x+2y) = (0, 0) =s>x=y=0> 84+xyty +1. 
By theorem 7.71, critical points +(1, 1)”, + (4/13, 1/J30 . 
Largest value 1, least value 1/3. 


4. (2x, 2y, 2z) + (0, 0, 0) if x*+y?+z? = 2. Lagrange multiplier A such that 
Nx+ty+z)=0. A=0_ gives critical points + (,/(2/3), (2/3), /(2/3))*. 
xt+ty+z = 0 gives +(0, 1, —1)7, +(—1,0, 1)”, +d, —1, 0)?. Largest value 2, 
least value — 2. 


5, Theorem 7.71 applicable. Lagrange equations 
2x—y—z+A(2x—2) = 0, 
2y—z—x+AQyt+2) = 0, 
2z—x—y+AQz+6) = 0 
yield A(xty+z+3) =O0A=0>x=y=z2; 3x*+6x+9 > O.x+yt+z= —3 


_leads to a3 Ses Pee 
ae AS 
Substitution in condition gives A = — or 3. Hence critical points (2, —1, — 4)F, 


(0, —1, —2)?. Former gives supremum, therefore maximum; latter gives infimum, 
therefore minimum. 


6. Box has length x, width y, height z. 

(i) Consider V = xyz under constraint 2(yz+ zx+xy) = a*, xi y, z 20. 
(y+z, z+x, x+y) + (0, 0, 0) if x,y,z > 0. Critical point x = y = z= a//6. 

V = 0 when one of x, y, z = 0. Also xy < 2a?, xz < 2a and so V < a*/(4x). 
Thus V is small if x (or y, or z) is large. Hence sup V exists and is attained at 
(é, 7, O", where &, 7, ¢ > O and 2(76+¢6 +7) = a’. So €E, 9, C)f is the critical 
point € = 9 = ¢ = a/,/6. | 

(ii) V under constraint 2yz+2zx+xy = OO. 2 eS 0 5 = yea, 
€ = a/(2v3). 
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7. Rank of e 2y 7 
eee ee 


is 2 unless x = y = z,and x = y = z does not satisfy conditions. Critical points 
(1, 0, 0)”, (—4, 4, 3)”. Largest value 1, least value &. 


8. (i) Stationary points + (,/2, /2, /2)", both in E = {(x, y, z)?|x*?+y?+z? < 9}. 
(li) (2x, 2y, 2z) has rank 1 when x?+y?+z? = 9. Lagrange’s equations 
2—yz+2Ax = 0, etc. Subtract in pairs: 


(y—z)(x+2A) = 0, (z—x)(y+2A) = 0, (x—y)(z4+2A) = 0. 


Then x= y=z= +3 or y=z= —2A, x = (2A2—-1)/A, etc. By sub- 
stitution in constraint, A? = 1 or 1/12. Hence critical points are + (./3, /3, /3)*, 
+(—1, 2, 2)", + (5/3, 1/,/3, 1/,/3)7 and 8 more by cyclic permutation. 

E is compact and so f(x, y, z) = 2(x+ y+z)—xyz assumes its bounds on E. 
If a bound occurs in interior, then point is stationary point without constraint; 
if in {(x, y, z)7|x?+y?+z? = 9}, then pointis critical point of funder x?+ y?+ z? = 
9. Hence bounds of fon £ are taken at some of the 16 points of (i), (ii). Evalua- 
tion shows spuremum 10 (at (— 1, 2, 2)” etc.), infimum — 10 (at (1, —2, —2)? etc.). 
9. Theorem 7.71 applicable in (i) and (ii). In (i), critical point (4, 0, 4)7, which 
satisfies 4x? + y” < 16. In (ii), critical points (2,0, —1)",(—2,0,3)7,(—4, +./15,3)7. 

By argument of 8, bounds of x?+ y?+ z? on {(x, y, z)"|x+z = 1; 4x?+ 7? < 16} 
taken at some of the five critical points of (i), (ii). Supremum at (—4, + ./15, 3)?; 
infimum at (4, 0, 4)”. 


7(h) 
1.) af(x) = Dix) = = x; D; fe). Hence 
Es 


a? f(x) = a 3) x Dif) = 2x (3 x:D.0, (x)+ D; (9). 


(ii) Take € = (€,, ..., Sn)” with £, +0. Multiply first row of Hf() by &, and 
add to it (2nd row)+...+£,,(mth row). By exercise 7(6), 3, all elements of 
first row are now 0. Hence £,Hf(&) = 0, i.e. Hf(E) = 0. Since Hf is continuous, 
Hf(&) = 0 also when £, = 0. 


2. When (x, y) + (0, 0), 


xt+4x%y?2— y4 
iia 


(x2+ y%)2 Sexe. 
Since f(x, 0) = 0 for all x, D, f(0,0) = 0. D,f is continuous at points (x, y)? 
+ (0, 0)". |Di f(x, y)| < 3|y| and so D, fis continuous at (0, 0)? also. D. f(x, y) 


= — D, f(y, x) has same properties. Hence / is differentiable. 
All second order partial derivatives exist in R?—{6}. Also 


D,D, f(0, 0) = rie Need (0, y)—- Di FO, O)}/y = ne —Wiy = -1. 
—> yo 


Similarly D,D, f(0, 0) = 1, D? f(0, 0) = D2 (0, 0) = 0. 


3. Both sides equal A(h, k)/(hk); e.g. if g(t) = f(E + th, n+k)—fE, 7+h), 
there exists « in (0, 1) such that A(h,k) = g(1)—g(0) = g(a). Again, by mean 
value theorem, 


(Di f+ eh, 7+k)— Di fE+eh, nik = D,D,f(E+ah, 9+ Bk) 
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and r.h.s. > D, D, f(é, 7) as (A, k)? > (0, 0)". Thus, given ¢, there exists 6 such that 


|{D. f(E+h, n+ Pk)— Def E, n+ Pk)}/h— D,D,f(&, 1)| < € 
if 0 < |h|, |k| < 6. Take 0 < |A| < O and let k > 0. Then 


(Da fE+h, 1)—DafE, DHh— D2 Dif 1)| < ¢. 


4. Let g be a real function differentiable at 0 only (e.g. g(x) = x” when x is 
rational, 0 when x is irrational). Define f by f(x, y) = g(x)+g(). Then D, f(, y) 
= D, f(x, 0) = g’(0) for all x, y; and D,D,f(O, 0) = D,D,f(, 0) = 0. All three 
sets of conditions involve the existence of D, f, D, f in an open set containing 
(0, 0)7, but D, f(x, y) exists only when x = 0, Dz f(x, y) only when y = 0. 


5, The continuity of the kth order partial derivatives implies the differentiability 
of the partial derivatives of order k—1. Hence fand its partial derivatives up to 
order p—1 are differentiable. Then, by induction and the chain rule, for 
k =1,...,p, g(t) (|t| < r/||A||) exists and equals D™f(£)(h)*. By C1, 81, there 
exists « € (0, 1) such that 


g(1) = g(0)+2'(0)+...+¢?-%O)/(p— 1)! + 2(@)/p!. 


6. By 5, for ||Al| < 6, say, f(E+h)—f(E) = 4D°f(E+ ah)(h)*. 
@if H,f >0(k =1,...,m), by continuity, H,f(E+h) > 0 when 
|Al| < 8’(<6). Hence f(E+h)—f(E) > 0 for 0 < |hl| < 0. 
(ii) Similar. 
(iii) Take any he R™. When 0 < t|/Al| < 4, 
A(t) = f(E+ th)—f(E) = 4D°fE+ ath)(th)?. 


As t +0, A(t)/t? > $D?f(E)(A)?. 

If fhas a minimum at , A(t) > 0 for all sufficiently small ¢ and so A D2f(E)(h)? 
=lim A(t)/t? > 0. Since this holds for all h, D?f(£)(.)? is positive semi definite. 
If f has a maximum at &, then D?f(£)(.)? is negative semi-definite. But, if (iii) 
holds, D?/()(.)? is indefinite. 


7. f has stationary points (0, 0, 0)", (—1, -1, —1)7, (-1, 1, 1)”, , —1, D7, 
(1,1, -1)%. At (0,0,0)", H,f=2, H,f=4, Hf = 8. Elsewhere Myf = 2, 
H.f = 0, H.f = —24. Minimum at (0, 0, 0)7; no other maxima or minima. 

Infimum of fin x?+y?+z? < a? taken at (0,0, 0)? or on x?+y?+z2? = a’. 
funder constraint x?+ y?+ 2? = a® equivalent to a*+ 2xyz under same constraint. 
Critical points (+ a//3, + Ja/./3, + a/V3)", (+4, 0, 0)7, (0, +a, 0)", (0,0, +a)". 
f( +a, 0, 0) = a’, f(+a//3, ta/J3, ta/V3) = a®+2a*/(3V3). When a? < %/, 
infimum 0; when a? > 22, infimum a?— 2a°/(3./3). 


8. Stationary points (0, 0)”, (4, 4)7, (—4, 4)7. Maximum at (0, 0)7; neither maxi- 
- mum nor minimum at (4, 4)", (—4, 4)”. If f attained its infimum on K at an 
interior point, this would have to be a minimum. 


8 (a) 
1.0d)¢~¢.Q¢~yv=¢= wou, where & satisfies (i), (ii), (ili). Since & 
strictly increases, a! exists, so that y¥ = goa. Clearly a satisfies (i), (ii), (iii). 
3) d~ywandy ~ v= ¢= poaandy = yof = ¢ = xo(foa). If a, # satisfy 
(i), Gi), (iii), so does Pow. 
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2. Define a: [0, 3] > [a, b] by 


Sahib (O<t 
att) = 
4(3a+6)+4(6-—a)lt A <t 


< 1), 
< 3) 
and put % = doa. Then & ~ ¢, but vy is not differentiable at 1. 


3. Let 9’, w’: [i—1, i] > R” be (equivalent) representations of y; (i = 1, ..., m) 
and define ¢, Y: [0, m] > R” by 


At) = ot), WtH= Yt) G-1 <t <i). 


There exist «:[i—1,i]>[i-—1,i] such that ¢* = foa‘, ai(i-1) = i-1, 
a‘(i) = iand (ii), (iii) are satisfied. Since «‘(i) = «*+1(), we can define « : [0, m] > 
[0, m] by a(t) = a(t) G—1 < ¢ <i). For i-1 < ¢ <i, i-1 < a(t) = a(t) < i 
and so 9(t) = $(t) = ¥'(e(t)) = W(a@(t)) = W(a(t)). Since this holds for all i, 
@ = wou. Also & satisfies (ii), (iii). 


4. Let ¢', ¢? be the restrictions of ¢ to [a, uJ, [u, b]. Define «!:[0, 1] > [a, u], 
a*:[1, 2] > [u, b] by 


a(t)=at+(u-a)t (0 <¢t <1), a(t) = u+(b—uy(t-—1) (1 <t <2). 
Define ¥" on [0,1], ¥? on [1,2], % on [0,2] by y* = dion (i = 1,2), 
Y(t) = '() for i-1 < t< i @ = 1,2). Then y' ~ ¢}, y& ~ ¢? and clearly 
y ~ . Since y represents y,;+y, and ¢ represents y, y;+7Y. = Y. 


5. If ¢ on [a, 5] is any representation of y and J is the dissection of [a, b] with 
no intermediate points, ||g—p|| = A(Y, ¢) < A*(¢). 


6. Trace {(x,0)|—1 < x < 1}. A(y,) = 2. A(y.) = 6 ¢3 not of bounded 
variation. 


2q7 
7. Ay) ae 2(2+1] 


2 
8. A(y) = [ J2+12)dt = 6+ log(/2+ 4/3). 


a cos! t 


as (2+1) ‘ P tdt = &(p+q) 
i= = a cos — tat = +q)a. 
2q q 0 2q f 


9. Let ax u<v<«<b. By theorem 8.12, s(v)—s(u) = A%(d) and, by (8.11), 
0 < Ax) < V2%(9)+...+ V2(¢,). R.h.s. > 0 as v—u > 0, by theorem 6.44. 


10. Adapt the proof of exercise 6(z), 3, using 9. 
11. Since |f| < 1, the series defining ¢, y converge uniformly. 

ia; =O, then © <7 < tand/(7) = 0; ife, = 1, then} <7 < 1 and f(z) = 1. 
In either case, f(7) = &. 


3°7 = even number + 0:(2e,)(2/,)...(in ternary scale). 


Hence f(3’7) = &. In general f(3*"-*7) = «, and so g(r) = §. Similarly (7) = 4. 
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8 (5) 


1. Since ¢ ~ wv, ¢ = oa, where « has usual properties. There is a dissection 
a= a) <4a,<... <a,=b such that ¢,@ are continuously differentiable in 
each [a;_,, a,] and vv is so in each [c;_;, cj], where c; = %(a;). By theorem 6.92, 


[’ (Zov)v; = a {(foW ViJoa}a’ 


j—1 


= [" (hovoalijoaa’ = [" Goss 


j—1 


Second part similar, for (|y)|oa)«’ = |¢;| since |a’| = &’. 
2. In an interval [a;_,, a;] of continuous differentiability of ¢, 
[2 n6—m— #(- onde = [per otod 
3. ¢ on [a, b] represents y, Z is a dissection of [a, 5]. Then 
IZ AA(7)} (At) -— PDH < TIAGCd3| |GG)-$G-D)| 
< Ul PPC} {sg(t) — Sg(tsi-D} < MAC). 


As «(Q) > 0, first term > f f\, third term > { || f || ds. 
Y Y 


yx ty =00< r= 2; [ oe. 
1 


yuy =,ye 10 = fe Dy; [ = (1/,/2) arc tan (1/,/2). 
Ye 


Weixe f y= 10S 1S 1); i = 1—(1/,/2)arc tan /2. 
y3 


{ = po. = 1/(2V2)-1. 


5. Trace = {(x, y, z)|x2?+y? = 2ax}n {(x, y, zl +y2+z2* = 4ax}. i = Sgn, 
Y 


6. [ = ;:7°abc, i: = —7;mabc. 
2 (on 


7. 6 on [a,b] represents y; @ = a) < a, < ... < a, = b are such that @ is 
continuously differentiable in each [a;_,, a,]. Then 


[1- [ome = 3 |" (3 risen) ar 


aj—1 \t= 


T aj d a 
i: | < sO}dt = D leo) 6 
j=1 Jaj-1 rs 


= g(v)—g(u). 
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8. By hypothesis, g is uniquely defined. If xe D, B(x;5) < D for some 6. 
For |t| < 6, x+te'e D, where e* has ith component 1, others 0. If |A| < 4, 
let y,(A) be the segment from x to x+he*. Then 


h 
e(x+he') =| f= e(x)+ | F(x + te')dt, 
yart yilh) 0 
i.e. ghey 8) a 5 [Act teat 


Since fis continuous, r.h.s. > f,(x) as h > 0. 


9. / = csinh (a/c). sech?x < 1 (x > 0) = tanh (a/c) < (a/c) = (d/dc)c sinh (a/c) 
< 0 for c > 0. Hence c sinh (a/c) decreases as c increases. Since c sinh (a/c) > 
coo as c>0+ and >a as c>o, the equation in c, c sinh (a/c) = / (> a) has 
a unique solution. 


10. 3514/25. 


8 (c) 


1. To simplify notation, take a; = 0, b; = 1 (i = 1,..., 7). Let D, be the dis- 
section of [0, 1] x... x[0, 1] into r” congruent subintervals. If M = sup |/|, 


S(G,)—1B,) = ; a= tr(2. 52) -1(4=, eras | 


dis ever tn=l ‘2 r r F 


1 | > (2 =) x (3 =)| = 2nr’ 1M 
= Sermet Se ae ere Ty ie — 
r"\sometiz=n \P oe someiz=0 \" r - 


_ 20M 
oe 


>0 as r>o. 


2. There is a dissection Y of I into subintervals 1, ..., J, such that 
Tr 
SD’, N-\Bf) = = (M,—m) VI) < «. 


Let D be the dissection of K into the subintervals K, = I,xJ. Since V(K)) 
= VI)VV), SG, $)-S(G, 0) < eV). 
Similarly ~: R”xJ— R', defined by w(x, y) = g(y), is integrable over K. 


Hence A = ¢y is integrable over K. Also | h(x, y)dx = 20) | f(x)dx. Use 
I I 
theorem 8.31. 


5. Inner integral is elliptic, not expressible by elementary functions. But integrand 
continuous, so interchange of order of integration legitimate. Integral 
= 2#(tan ~—sec a—cos a+2). 


6. Define ¢:[0, 1] > R' by d(x) = 0 (x irrational), d(x) = 1/¢ (« = p/q, p,q 
1 

coprime). By exercise 6(/), 9, 1 ¢ exists and is 0. Let Z’ be a dissection of [0, 1] 
| 0 


into subintervals [a,;_,, a;] such that S(Y’, 4) < e. Let f*:[0, 1]x[0, 1] > R 
be given by /*(x, y) = ¢(x); let D be the dissection of J into subintervals 
[a;-1, a;] x [0, 1]. Then sD, f) = 0 and SF, f) < SQ, f*) = SD’, d) < « 


CHAPTER 8 499 
1 
By exercise 6(h), 9, [ I(x, y)dx = 0 for every y. 
0 


1 

By illustration (ii), p. 141, I f(x, y)dy does not exist for any x. 
0 

h = f+g, where g is defined like f, but with x, y interchanged. 


7 1 
= { f(x, y)dx = 0 for every y; { f(x, y)dy exists for x = 47 only. 
0 0 
Let Z be a dissection of J = [0, 47] x [0, 1] into subintervals J;, ..., J,. Then 
r 1 vA 
GD, f)-s\D,f) = oe care 
$9, 1)-99,) > 3 (-9) VOD = Z5 
Hence fis not integrable over J, and so not over I. 


8. Any [x,x+6]x[y, y+6] < I contains a point of P;, where p, > 1/6. If 
a = mlp (p prime), then the lines x = a, y = a each contain p—1 points of P; 
if a not of this form, the lines contain no points of P. 


1 
For every x, f(x, y) = 0 for all except a finite number of y. Thus { f(x, y)dy = 0, 
0 
1 1 FT 
{ dx | f(x, y)dy = 0. Since P is dense in 1 | f= i f = 0. 
0 0 I PE 


1 1 1 
9 If0<y< 1 | I, y)dx = 1. Hence | ay | f(x, y)dx = 1(y = 0, 1 need 
0 0 0 


1 1 
not be considered). dx [ tS, y)dy = -1. 
0 


8 (d) 
1. Let I, > E; let J,, I, be non-overlapping with common face. Then fe: = 
in J,—(/, N 1,), so integral of fz over I, is 0. Use (8.33). 


2. Call interval J; I, = [a,+¢, 6,-€]x...x[an+é, b,—€], (€ > 0, small). Then 
I, © 1 I,cU,) < cD < cd) < ec). 


3. Let m = inf f, M = supf, J any closed interval containing E. Then 
0 = me(E) = | mus < | fe < | fa < | Mxe = Mc(E) = 0. 
z ai I I 


4. (i) f = 1 is not integrable over a set without content. 
Let A < E < I, where A has content, J is a closed interval. If Q is a dissection 
of J into subintervals J;, 


S(D, fa)— QD, fa) = t+ U4 Zs) (Mj — mj) Us), 


where =, ranges over the J; ¢ A°, XZ, over the J; ¢ I—A, 2%, over the J; with 
points of frd. Then ¥, < SQG,fz)—-SQ,fz), U2=0, Us < (SG, xX4)- 
s(Z, x4)Msup fz —inf fz). X41, Zs > 0 as WD) - 0. 

(ii) fe_x =fa—fx. 


5. (i) Replace function yz of exercise 6(h), 5 by fz. 

(ii) fintegrable over E, 0 E,, by first part, and so over E,— E, = E,—(Ei Nn E,), 
by 4 (ii). | 

(iii) Put f = f+—f-, where f+ = 4(\f|+f) = 0,f- = 4F|-S) 2 0. 
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6. Let m = inf f,; M = sup/ in £, J a closed interval containing E. As in 3, 
wAk) = | fn < | fig < MOK) 


and so integrals > 0. Also, since fz = fz_x,+/fx;; 


[fe < [tant] few [ f > | fen 


+{ ine 

ot 

and so { te < lim int | foxy < lim sup | fem < | tre 
I I I JI 


7. (i) If m = inf f, M = sup f, Ia closed interval containing E, 


| ms < | fs <| My 
I I I 


(ii) Since E' is compact and connected, so is f(E) ¢ R'. Hence f(E) is the 
finite closed interval [m, M]. 


8. For x, ye J, put |x—y|| = 6, min(x,, y,) = u;, max(x;, y = v; @ = 1, ..., 7). 
Then 0 < v;—u; < 6. If M = sup |f| in J, by 5, 


Lo LA 


bes CT gnJai 


< | il 
vw—-du 
< Mc(J,—J,) 


= O(6) as 6-0. 


9. Let D be closed disc with centre (a, b), radius r. Then xp is continuous except 
on path given by x = a+rcost, y= b+rsint (0 < ¢ < 27). Hence c(D) 
exists and, if J = [a—r, a+r]x[b—r, b+r], 


at+r b+r at+r 
e(D) = i ee [ as [ ake dir [ Ie = aide = ae 
I a-r b-r es 


Since c(fr D) = 0, c(D°) = c(D). 


10. (i) There is a dissection of Jinto subintervals 7, such that 2(M,—m,)c(I)) < €. 
If J, = I, x [m,, M,+€] (note m, may equal M,), E < U J; and 


UcV,) = &(M,—m,+6)c) < e+ec(Z). 
(ii) Let 
Do = {(%, eeeg Xavi Ch, eee9 Xn) E : ie 0 < Xn+1 3 I(x, eery Kats 


D, = {iy ..-s Xnevd iO, 5 MEE, 0 Se Se 


Dy, is open, D, is closed. Since xp is constant in D, and in D{, yp is continuous 
on D,U Dj. : | 
Let J be a closed interval containing E, S the set in (i) corresponding to 


fe. If gq > supf, (Dyn D,) < Ux (0!) U SU (fr Ex [0, g]) 


CHAPTER 8 501 
and sets on right have zero content. (fr E has content 0 in R”.) Hence yp is in- 
tegrable over J = J [0, qg] and 


qd 
a = [ eae { ieee a | oe 
ae I 0 


fHE( 21. «+65 En) 
= | dry..dis | VdXn41 =| fe = sa 
Z 0 : : I E 


Ag ae 
11. In both parts the subsets of R? have content. (i) 3p? (ii) 3373)" 


‘ 


13. Let E be triangle with vertices (0, 0), (1, 0), (1, 1). Since integrand is bounded 
on E and continuous except when x = 0, 


1 1 1 x e—1 
{ ay | e~V* dx | e*dxdy = { ds | e~vtdy = —. 
0 y E 0 0 2e 


14. E has content and integrand f continuous on E. If 
A, = {0 2ly = 0, z = 0, y+z < 1-45}, 


1 1 1-z 1-x-y 47 
{ ‘| dx faydz = | ax | ay | fdz=-=--. 
E 0 Ax 0 0 0 5 4 


15. ¢ on J = [a, b] x [a, b] given by A(t, u) = f(t) is integrable over J (exercise 
8(c), 2). Hence (x—t)""1f(t) is integrable over J and so over triangle with vertices 
(a, a), (b, a), (6, 6). Then 


1)! | Sie { ae { "(u— ty f (dt = { a { ati fia. 


8 (e) 
1. o differentiable at x: ash—>0O+, 
fet W—-f@) _ oh x+ A) f@)-fO— _ oUe—h, x) 
h ~ (x, x +A)’ h c([x — fA, x]) 
f differentiable at x: |f(x+20)—f(x)—tf'(x)| < et if \t| < 6. Hence, when 
0<h,k < d,h+k > 0,7 
a([x—h,x+k]) ,, 
efx —h, xk) 7 
_ f+ -fO)-KF'OI+ F@O-SE-M-M'OD © , 
= h+k a 


2. (XN X) = c(X) = 0, so o(X) = (XU X) = o0(X)+e(X). 


+> Do(x). 


8 (f) 


1. (i) If there exists x € J(u; a) such that x ¢ E, then the segment from u to x 
(which lies in J(u; a)) contains a frontier point of E. Impossible. 

(ii) If there exists x € (R"—I(u; b)) Nn E, consider the half line A from u through 
x. Since E is bounded, there exists y¢ A—J(u; b) such that y ¢ E. Hence the 
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segment from x to y (which lies outside Z(u; b)) contains a frontier point of E. 
Impossible. 

(Clearly the cubes could be replaced by convex sets S,, 5S, with S, © S, and 
En S; + 2.) 


2. If X is the set of points of discontinuity, proceed as in (ii) of proof of theorem 
8.61 (8.62), but let S, cover QU X (QU XU frG). 


3. Let AB be side in question, C opposite vertex. A rigid body motion takes 
A, B,C to (a, 0), (6, 0), (p,q), where g > 0. Let a< p <b. Since segment 
from (p, 0) to (p, g) has content 0, 


P q(x—a) i 
content = ——— dx+ ——_— dx = b--a). 
[ ee pe x = $q(b-a) 
Cases p < a, p > b similar. 

5. 47ab(a?/p* + b?/q?). 


6. Clearly % is continuously differentiable and injective in R?. E = ~-(D) 
= {(u, v)|0 < u<v < 1}, J¢ = 1. Integral = 47-4. 


7. JO(x, y) = 8xy, so Jd(u, v) = $(uv)-*. Content 4(b—a)(q—p). 
Qn 2a(1+cos @) 

8. cS) = | do i rdr 
0 0 


10. If D is the disc with centre (— 2a, — 2a) and radius ./40a, then 


aa ee 2 2 
c(Q) =| (=e) _2 a2 dxdy = 1007a?°. 
D Z 8a 


11. () If, in R*, D = {((x, y)|x?+y* < 40%, 


67a’; centroid (3a, 0). 


content = 2 c V(a?— x*?— y*) dxdy = 1(8 —3./3)7a?. 
(i) If, in R?, E = {((r,0|0 <r <asind,0<0< 7m, 
content = if /(a?—r*)rdrdb = %a® [ "a — |cos*6|) dé 
= $q° i” (1 —cos*0)d6 = 2(37—4)a3, 


12. (i) Given 4%, ..., Xn, x{+...+x2 = r? determines r; then x, determines 
On-1, «++, Xz determines 0,, x, and x, determine 0. 

Let J, be determinant with r = 1, so that J = r*-1J,. Multiply first row of 
J, by cos @,_,/sin @,_, and subtract from last row. This becomes o ..,%, 
—1/sin @,;, and sin @,_, factor of all elements in first n—1 columns. 
J, = —sin"-*6,,_,Jn_1. Since J, = 1, result follows. 


7 
(ii) Use theorem 8.62 and note that, if J, = i sin*@ dé, 
0 


Tox-tlen = 7/k,  Toedorss = 27/(2k + 1). 
13. (19-24 log 2)/36. 
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Qn > 
14. J D, f, = [ ar | D,f,(r cos t, r sin t)rdr 

B 0 0 
2m i or) 0 

= { ar Fe [f(r cos t, r sin t)r cos t]—-— [f(r cos tr sin ¢)sin a] dr 
0 0 (er ot 
2a aoe Qn oO 

= { f,(cos t, sin t)cos tdt— { dr { a [f(r cos t, r sin t)sin ¢]dt 
0 0 0 


| 2n 
= { fx(cos t, sin t)cos tdf, 
0 
by theorem 6.84. Similarly 


2n 
| { D.f; = { f,(cos t, sin t)sin tdt. 
B J0 


8 (g) 
x e . 
1. For every X > a, $x = [ f(x, .)dx is continuous on the compact set d; 


a 
Adapt the proof of theorem 5.35. 


2. Since ¢ = i flu, .)du, ¥ = f(., y) are continuous, 
pares, = 2 |" ( f° feu edd) do = |" 70a rd 
y Jb a a 


D,D.F(x, y) = { f(u, y)du = f(x, y). 


3. Let (7) exist, where 7 = (1, «-- In)- Take any y = ()1,.-., Yn) in J. Then 
es y) — f(x, 1) 


Yn Ye 
= { Dinu Sf (X; t, V2 e089 Ynydt+ Diniz (x, UE t, ae Pnpet 
1 Ne 


Un 
ae { D. fies 68 
Yn 


and r.h.s. is continuous on J (theorem 8.71). Hence f(., y) is integrable over J. 
Formula for and continuity of D,¢ follow from theorems 8.71, 8.72. 


4. Define f on [a, b]x[c, d] by f(x,y) = 1& rational), 0 (x irrational). Then 
D.f (=0) is continuous, but f(., y) is not integrable for any y. 


6. When showing ¢’(a) = 0 distinguish between a = 0, a + 0. Since 4(0) = 0, 
g(a) = 0 for |a| < 1. For |a| > 1, 


g(a) = P(1/a)+ [ log a?dx = 27 log |al. 
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z 
7. If w on [c, d] x [c, dis given by W(y, z) = [ tS (x, y) dx, then 
a. 
Zz 
DQ, Zz) = [ D. f(x, y)dx, Dey, z) = f(z, y); 
a 


and 0'(y) = (d/dy) Ly, g(y)} 
= Difty, ev}+2’0) Davy, 20}. 


8. Since B is a region, independence of path of : f is equivalent to existence of 
Y 


g such that f = grad g (8(b), 7, 8). | 

(N) If f= gradg, then D,f; = D;D,g, D;f, = D; Dig; D;D;g = D; Dig, 
since both are continuous. 

(S) Let c = (cy, ..., Cn) be centre of B. For any x = (x, ..., Xn) € B, let y; 
be the path from c to x consisting of the segments determined by the points 


(c,, eoeg Cady (c,, eeey Cn—1> Xn)s cers (c,, X95 seve Xads (x1, ees so | 


Define g: B > R! by 
Ly "rn 
g(x) = [ f = [ Fil, Xq5 0005 Xn) Att. + i Ileus cz Gy st. 
Vx Cy Cn 
Then D,g(x) = A(x, ..., Xn) and, by theorem 8.72, for i > 2, 


Ly Li—1 
Dia(x) = [ D: ft, X25 seiy OE Di filer, s++9 Ci_oy t, Xiy very Xn) at 
Cy Ci—1 


+fi(c1, coey C45 Xis eees Xa) — Sil%1, errs z). 


since Df, = D1, .... Difiaa = Di-afy 
9. log(b/a). 


11. (@) For y>0, 4() = [ ey 
0 


formly for y > 0 > 0 (see example of §6.9). Hence ¢(y) = 4my+A(y > 0). 
But ¢ is continuous on R! and so 0 = ¢(0) = lim (47y+ A) = A. Since ¢ is 
y—0+ 


dx = 47, since integral converges uni- 


even, $(y) = 47|y| (all y). 
(ii) Call integral Y(y). Then w’’(y) = 37 (y > 0) and Wy) = 37y? (y > 0), 
— amy? (y < 0). 


12. |sinh xy| < cosh xy < e*lvl < e!” for x > 2\y|. This gives existence of 
oe) 

?(y) and uniform convergence of | xe-*sinh xy dx (integral for ¢’(y)) on every 
0 


finite interval. 
Differential equation implies e~*”"¢(y) = A. Since 6(0) = 4,7, d(y) = 4 ret’, 


13. Integrals for ¢(y), ¢’(y) converge uniformly for all y, integral for ¢’(y) 
for y2dO>0. _ Differential equation shows P(y) = Ae”+ Be-¥+4z, 
'(y) = AeY—Be~ for y>O. These also hold for y = 0, since ¢, ¢’ 
continuous on R. ¢(0)=¢'(0)=0 give A=0, B= —4i7. Thus 
Py) = 4n(1—e~”) for y > 0, diy) = —4n(1-e”) for y < 0. 
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‘ = t ; : 
14. For any a, if b > 0, D.d(a, b) = i) trees dx, since integral converges 
dx 


uniformly for b> /> 0. Similarly D, D.¢(a, b) = I, (14 a®x’)(1 1 58x) if 


b > 0; integral = 47/(a+)5) if also a > 0. 

For a> 0,b> 0, D.¢d(a, b) = 47 log(a+b)+ A, where A independent of a. 
But D,¢(a, b) >0 as a>O and so D,¢(a, b) = 47{log(a+ b)— log b}. Hence 
d(a, b) = 4n{(a+b) log (a+ b)—a—b log b+ B}, where B independent of 5. 
As 6 > 0, d(a, b) > 0 (and 5 log b > 0). Hence B = a—a loga. 


15. Use theorem 4.73 as in proof of theorem 6.33. Integral = (7/2). 


9 (a) 
—** eS 
ae cos x 32 52 ae 
oe 1 +(~— 1) con be 
5. OF a 2p n sin nx, 
Gi 4 ~, se sin(2n+1)x if 5 is even. 


fy—1 (2n+1P-—b* 


10. In example 4, ¢, is a lower approximative sum to the integral 


™ sin tx 
[ Se 
0 t 


. (7%sin u 
For the last part, the analogous integral is = du. The integrals over 
0 


(0, 7], [7, 27], [27, 37], ... have alternate sign and decreasing magnitude. 


9 (db) 
a x sin(1/x) (0. <a < 1). 


9 (c) 
2. By theorem 9.54, given ¢, there is (— 4, 6) in which osc. Fourier series of /, 
is <é. Osc. Fourier series of f, in (—4, 6) is 2KA, where A is the Gibbs ratio 
- - sin t dt 


eT a 4 


9 (d) 
1. No. |z| > R and lim sup|a,|¥”" = 1/R contradict a,z” = o(n). Theorem 9.62. 
xz" summable (C,1) for |z| < 1, z + 1. 
Uniformly summable in {z| |z| < 1, |z—1| > 0}. 
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nr 
2. (i) s,(x) = a,x" increases both with ” and with x. 
1 


From theorem 4.73, since lim lim exists, so does lim lim. 


r—>1— n> no 2—>1- 
(ii) Write nlja,| = c, Givene, take m to make c, <«¢ for n> m. Use 
1—x* < k(1—~x). 


m (oe) 
Ea- Za 
0 0 


m 
< (1-—x) dient 
0 


Take x = 1—(1/m) and let m > o. 
3. We can take c, real and k = 1. Given e¢, choose N such that 


d(1—6€) <c, <d,i+e6 for n>N. 
oe) N ©O N co 
Then C(x) = Denwx"® = Yt Dd < Dlenlx"++9 Dd,x". 
0 0 N+1 20 0 


fee) 
Divide by D(x) = )d,x" and let x > 1-. 
0 
lim sup{C(x)/D(x)} < 1+6. 
Similarly, lim inf > 1—e. 


x 
4. Integrate I (1 -*) x* sin x dx by parts. 
0 


Sum (A) = lim { e-% f(x) dx. 
60+ J0 


9 (e) 
1. (f+)? and (f/—g)”. 


2. Use Parseval on §9.2, example 1, and exercise 9(a), 7. 


3. Use 1 and exercise 9(a), 2. 


9(f) 
ore) —N N 
1. mla(u,)—a(uy)| < a + 4 lf| + { f(t)(cos u,t—cos upt)dt}. 
— CO —N 
Given €, there is N making first two J's on right side less than ¢. 


N 
Last term < Nla—ual | [f| < ¢ if |u,—u,| small enough. 
—N 


2. (iii) : = 


“(A+ x? 


| = 
3. The transformed integral { e-* cos xu du is elementary, with value b/(x? + 5”). 
0 
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10 (a) 


1. (i) Collinear z, <> J real 4,, A,, A; with UA, = 2A,z, = 0. 
From the last, 2A,zZ, = 0. Hence condition 


(ii) 7 4 


ro ee 
2. See Note (2). Constant term in f(z) is n! ap. 
oes, <1, 7 +1, 

— | —z)Za,z"| = |ag—(@o—@)2—... — a,2"*| 

> Ay— |(Agp—Ay)Z+... + Anz"*1| > ag— {(Agn—ay) +... tan} = 0, 

since the z” (r = 1, ..., 2) have not all the same phase. 
4. Iifx> 0, phz = arc tan(y/x). 

if x < 0, phz = arctan(y/x)ia for y>O or <0. 

2 ee phz= +497 for »> 0 or. <0. 
5. None; all except +7; only 0; real z. 


6. u?+v2? = k > uu,t+vv, = uu,tvv, =O>u=v=0 or 4&v,—U,v, = 0. 
C-R equations. 


7. These rules give a ‘dictionary-order’, but do not satisfy the axioms of an 
ordered field (see an Algebra text, e.g. Birkhoff & Maclane, Survey). 


If for example, a = —1—i, b = 1—i, c = 2i; then a < b but atc > bte. 
8. Solve with the conjugate 27+ bz+é = 0. 

If |a| + |b], one z. If |a| = |b], none or mfinitely many according as ac + 
or = cb. 


9. Use |A|? = AA. Equality in Cauchy-Schwarz if a,/b, same for all r. 


10. Assume >. Repeated bisection (C1, 59) of [z,, z,] gives contradiction. 


10 (6) 
1. If im a > 0, then |z—al < |z—a| <=imz> 0. 


2. (i) Circles on diameters (— 3, —4) and (0, $). 


3. From §10.5 (8), w—-1 = k=. Values for z = 0,1 givek = —1,a = —4. 
4. By continuity at frontier, w is real if z is real, Use $10.5 (2) with 0, 1, «. 
For last part, put z = 7. : 
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—a 
Be ; 
se Zn-1 =p 
If |k| < 1, z, >a; if |k| > 1, ern If |k| = 1, no limit. 
Ifa = f, z, >a. 


= ae 3 
.— is (5) . |w| < 1 gives Ph = in | 5. >|. 


w+l z+1 
= : 9 S777 
z- locus is three lunes, vertices at + Land-angies in rangés | — = |,) —.— |. 


& 2 6° 6 
ie 
2 6 


w—1 z—4\? 
. —— = | — | . See 6. 
wt+l S 5) 
8. Fixed points are e7*”, e~”'". Mapping is 
isis e7tln a a, 


Rae on Se 27i/n __ 


Regions rez, > 4 and |z,—1] > 1. 


1+ 2, Z3—1 


9. 2 ee = le HE Ss = 
Z3t+l 


1—z,’ 


10. Given R, there exists nj such that |z,| > R for all n > mp. 


10 (c) 


1. (i) and (ii). Every value of either side is a value of the other. Principal values 
are equal in (i), not always in (ii). 


2. exp(—2m-+4)z7. 


3. (i) |z| = k go into confocal ellipses; phz = « into confocal hyperbolas. 
For (ii), combine (i) with z = exp ¢. 
4. iz exp(—iz). 
5. Strip 4(22m—1)7 < x < 4Qm+1)z. 
6. IfO0 <r< 1], 
r sin @ 
a O+r2)+ia Meme 
log (1 +z) log(1+2r cos 6+r?)+/arc tan (eS) 

Equate real parts in &(—1)"-1z”/n = log(1 +2). 

Let r>1-—. First series = 4 log(4 cos?40) for —7 < @ < 7. Second series: 
sum = re log(1 +e’cos 0) = 4 log(1+3 cos?0). 

pags A 


7. Let z, = x-+iy, (r = 1,...,n) be zeros of p. x, 20. —~= » : 
p(z) 1 2—Zr 


Suppose p’(z) = 0 for z = x+iy, where x < 0. 


Real part of right side <0. Contradiction. 
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8. Absolute convergence if x > 1. Uniform convergence for x > 1+. 


9. (i) If z = Zo, |zo| < 1, take 6 with |z,| < 1—¢0.< 1. uc. in |z| < 1—¢ by 
M-test. 

(ii) if re w, < —6 < 0, then |[z"n”-1| < n°, 

(iii) Dirichlet’s test. 


10. Let >) A,() = (a,cos n+ b,sin nO) be the Fourier series of f(@). Then the 
0 is 


Poisson integral (theorem 9.73) is 
YA,(Ar” = re X(a,,— ib,)z” = re P(z), say, 
0 


where ¢ is regular for |z| < 1. 
1)"r"cos nO 


2 HD 008 nl 
: ee 


n?—4 


11 (a) 
1. 7iR?, —7R?, 0, 0. 


2. 3(1+i). From primitives sin z, e*. 
3. 27i, —8i/3, 0, —4. 
4, All. (l—z)7}. 

5. Yes, yes. 


6. Z. Z with segment [—2/, 2/] deleted. A region containing no circuit which 
encircles a point (+n+4)z7, e.g. the strip —47 < rez < 47. 


1h = re | (u—iv)(u,+iv,) (dx+idy) = | awe vv,) dX + | (—uv,+vu,) dy. 
Y 
By C-R equations, latter | = | (uu,+vv,)dy. So R= 4 + v*], i 


8. Theorem 11.11. Since |e**| = e-” < l ony, J is O(1/R). 
Integration by parts into det 
BG - | a iz® 
gives each term O(1/R?), a smaller bound. 


9. f(z) = {k+€(z)}/(z—a), where |e(z)| < € if d < 6) and a < ph(z—a) < #. 


Lie Lie {k+e(z)}idO > ik(B—a) as 6-0. 
Y a | 


(oe) ; ee) 
un. | ss =| 2 
—~o l+iy Se 
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12. Call the integrals I,J and the inequality to be proved |S| < 77. Use 


[F(e)|2 = fe) fle; 
e Ge?? do = wr (integer p 0) =O (tp = 0); 


7 
| e'?9 dd = 0(p + 0), 27(p = 0). 
-—7 


\ nT 
Then I= > (- Dana | Gem) 19 49 = —2ni§S, 
7 


m,n=1 - 
J = 2n7T; 
[Z| < aJ. 
Hence |S| < 7T. 


13. Let z = z(t) for a < t < b bea representation of y, 


b 
i [ sfc} Se)}z"(dt 


b dw 
= gw) dt = l|.his. 


11 (5) 


1. Quadrisect a triangle by joining mid-points of sides. Simple polygon can be 
split into triangles by diagonals inside it—proof by induction on the number of 
sides. 


2. Given ¢, take R, a square, centre ¢, side 26 so small that |f(z)| < ¢/é for z on 


OR». 


By theorem 11.11, I S| < 8e. 
aR, 


3. Continuity of f’ is assumed. 


eo, 


. Along one side a(1—i), a(1+i) of the square 
dz a idy @ y+tia : 
—- = = d = e 
i: poe : 35 — 


11 (c) 
. 27ia*t (theorem 11.81), 27ie*/3! (theorem 11.91). 


arc tan z (+7). 
2rif{n(y, 0)—4n(y, —1)—4n(y, 1}. Possible values are + 27i, + 77, 0. 
Theorem 11.91. 

mode Bk 1 { [a yee 

= (=) ~ Oni y (> *) Zz = 


since, for any given x, the = is u.c. on y. Let y be z = e* (0 < 8 < 27). 


aw No 
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5. Theorem 11.81. The < from §10.5 (8). 


6. Integrand is fe + Fe 34| f(z). R?/@ is outside C. 


—a R*-za 
7 ttyis C= ref) = se | Pac = S| wr nea 
0 
Also [, ze (u—iv)e- db = soeieese of [ Geb e9 dO = 0. 
0 
tS 
= @/o= =| (e-7)# 
fiz) <1 oe = 0: If jz) > 1,f() = —2ni/z. 
Gi fe) =2 5 { ee and, ony, € = 16. 


9. Integrate along semicircle ¢ = Xe? (0 < 6 < 7) and its diameter. By theorem 
11.81 and continuity of integrand for im¢ > 0. 


1 i 1 
f-0 = 5 | (7-74) 104 
10. If a rectangle R in D, U D, is divided by / into two rectangles R,, R2, then, 


f(z+h)-f@) 


1. 7, =f, ont ee conjugate of AGT : 


Since f;(Z) exists, f,(z) exists. 


12 (a) 


1. Branch point is z = —1 (if k is not an integer). 
Principal branch f, defined to be 1 for z = 0, is regular and has Taylor’s 
series in |z| < 1. From p. 354 and theorem 12.11, 


fz) = k(k—-1)...0k-14+1) +2)" 


k 1 
and (1+z)* = 1+kz+.. eee DM at 
2. 422?-yez4+452°...; R = 4a. 


3. Method of (say) C1, ex. 6(d), 6, namely: 
If w = (arc sin z)?, then (1 —z”)w” —zw’ = 
From Leibniz’s theorem, 


(1 — 22) w+ — (2n+ 1)zwt) — n?w™ = 0. 
Putting z = 0, we have coefficients. R= 1. 
4, z+4z9+7425+...; R = 4. 
5. 1—42?-»¢2z4...; R = 47. 
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6. 4—424+4523...; R = 27. 


7. ¥(—1)"2"*1/n! (2n+ 1)! (2n+ 1), all z. 
0 


8. Xd,z", where d, = number of divisors of n, including 1 andn. R= 1. 
9. cot z = —tan(z—47) say. 
im. 2; 15; 
11. f, regular at O and =0 at 3, 4,2, ..., =0. (i), (ii), (v) No. 
12. |a,| < MIr" gives 
[f@)— aol < ¥lag| [zl < ME(z\/r" = Mlz|/(r—le). 
No zero if this <|ap|. 


13. If f = 0, 9¢ in D with f(D + 0. This + holds in a disc B(C;r). fg =0= 
g = 0in BC; r) and, by theorem 12.12, in D. 


14. Use |A|? = AA. By uniform convergence of the series, 


Lf ipre|2d0 = |” Sande retn-9. 0 = Sandigr? 
= | lf(re | ==] man? e a ~thiniiiads ? 


all terms with m + n vanishing. 


15. By 14, 4, = 0-1 2 +k. 


k 
16. If p,(z) = > 6,2", the given | is 
0 2 Z 
2 |an— Oy P+ 23 |@n\*. 
0 k+1 


17. Let y be the circle z = Re‘? (0 < 6 < 27). By theorem 11.81, 


[ 2. [Ae fe de fae). 
- a—b 


a—b\z—a z-—b 


J 


18. (i) In theorem 12.12, |a,| = o(r1-”). True. 
(ii) fis a polynomial of degree at most k. 


Mr7rR 


< ja-b((R—la))(R—16) 


If |f(z)| < M, theorem 11.11=> 


19. (ii) If g(z) = f(z)— 8, then 1/g is regular and |1/g| < 1. 
If, a, b, c being suitable constants, au+bv—c > 0 [or < 0], then fis constant. 


20. Take f?. 
21. 1/|f(z)—wo| < 1/6. 


22. f/g is constant. 
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12 (b) 
1. For 1 < |z| < 2, binomial expansions give en (5) —-»> (;) F 
2 0 2 y 
= For jz] < 1, 1—z+z°—z*+z®—.... 
ii = 1, ee 


4. For |z| > 0, taking f round z = e”, 
| a 2) — ido 
J(u) = ey, = exp,(iu sin @) ond * 
To obtain the series for J,, multiply the series for exp 4uz and exp (—u/2z). 


5. Either branch is one-valued in |z| < 1 (or in |z| > 2), hence expansible in 
Taylor (or Laurent) series. Cannot expand in 1 <|z| < 2. 


7. From $10.1, the Jacobian u,vy—UyVz, = |f’|*. 


Ts Qn 
Area = : rdr | Lf'(re®)|2d0. See 12 (a), 14. 
0 


" 


12 (c) 
0. f(z) = z4—3z?+7z?—9z+6 = 0. We first look for real roots. Sajy= 0 
and only if x = 1. f(1) = 2, minimum. No real roots. 

To find the number of zeros of fin the first quadrant, let z describe the simple 
circuit (positively oriented) composed of (1) real axis 0 to R (R large), (2) 
quadrant z = Re‘? (0 < 9 < 4n), (3) imaginary axis from iR to 0. Call the corres- 
ponding paths of w(=/(z)),T,, 2, P3. We need to find the index of w = 0 
for ',+T,+I;3. I, is part of the real axis and at f(R) we can take Phw = 0. 
On I,, w = Rtet#{1+O(1/R)}. At f(iR), Phw is 27{1+O(1/R)}. To trace the 
change in Phw on T's, we have w = u+iv = yt—Ty?+6+i(3y?—9y). 

Mark the crossings with the imaginary axis uv = 0. First is at y = /6, second 
when y = 1. At former v > 0, at latter v < 0; u changes sign from + to — at 
former, from — to + at latter. When w returns to f(0), the value of Phw is 47. 
The index of 0 for r', +I, +1 is 2. Two roots in first quadrant. By conjugates, 
two in fourth. 


1. By theorem 12.5 integrals are —47i, (—1)"27i/(m—1)!, 77. 


2. Take a + b.c = a—b. Residue at a is coefficient of 1/¢ in Laurent expansion 
about = 0 of ¢-"(£+c)-. This is k,/c2"", kn being the relevant binomial 
coefficient. Residue at b is k,/(—c)?""1. 3 


3. Follow method detailed in 0. Rouché (theorem 12.62) with f = 5 shows no 
zeros in |z| < 4. 


4. 5. 8. As 3. 


6. On |z| = 146, |e? | < |z”| if d small. Rouché shows n roots in |z| < 1+0. 
There is one root z = e” with 0 = 00n |z| = 1. 
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7. w = sinh 77z gives 
u = sinh7xcos7y, v = cosh7x sin 7y. 


The four lines y = +4, x = 0, x = X bound a rectangle. 

As z describes its positively oriented boundary, w describes I composed of 
right half of the ellipse : 

Uu . Se 1 

sinh?X cosh2X 
from —icosh7X to +icosh7X and the line segment iv from v = cosh 7X 
to —cosh 7X. If rew, > 0 and X large, index of wy for I’ is 1. Hence w = wy 
exactly once in the strip. 


9. fie) = 2m (F—8)" 


1—az 


has m zeros at 0 and n at a. |f| = 1 on |z| = 1 from $10.5 (8). Take in Rouché 
g= —-a. 


10. Method of 7. 


11. Put € for 1/z and p,(€) = an 
0 


Given R, €, there is N with |p,(¢)—exp ¢| < € on |¢] = Rforn => N. 
Take € < inf|exp (Re*’)| for 0 < 0 < 27. By Rouché, exp¢ and p,() have 
same number of zeros in |¢| < R. exp ¢ has no zeros. All zeros of p, have |¢| > R. 


12 @) 


- None (removable at 1, —1). 7 

. 3. Simple poles at (—1)*. At 00, 2 has pole, 3 essential singularity. 
. Simple poles at 4(2x+ 1)7. Limit point is 00, singularity. 

. Essential singularity at 1. 


. Simple poles at 2u7i (n + 0). Remember oo. 


oo nN 2 N 


cos z = a+ib gives cos x cosh y = a, sinx sinh y = —5b, 
Continue either (i) as in 12(c), 7 or Gi) directly (ong) cosh y = A, + Ab, 


where A, ='4/{((at+ 1)? +53, A, = 4/{(a—1)?+ 67}. 
Then cos x = A,—A, and cosh y = A,+A,. Solve these. 


13 (a) 
1. (i)-(iii) |z| < 1 and |z| > 1. 
(iv) (vii) None. 
(viii) Strip |y| < log 2. 
2. (i) |z| < 1. ee (iii) x > 0. 
(iv) x < 2. (v)x > 0. (vi) Z, punctured at 0. 


(vii), (viii) None. 


CHAPTER 13 515 
3. On = (8t+f)+(e—-A)Xns 
where 2x,(z) = (2"—1)/(z" +1). 


4. M-test and theorem 13.11. Simple poles at +ni and +i/n (1 + 1), double 
poles at +i. 
Singularities at 0 and 00, being limit points of poles. 


5. Analogue of corollary 1 of theorem 13.11. 
6. By theorem 13.11, any function continuous, but not regular, in D is an ex- 


ample, e.g. |z| in any D. 


13 (5) 
1. Find, if possible, z for which all terms in the Taylor series have the same phase. 
(i) z = —ir gives e’, (ii) sinhr, (ii) (sinh Ar)/A/r. 


2. 1/fin theorem 13.22. 
3. Theorem 12.62. 


5, Let w = exp (2zi/n), where n(f—a) > 27. Apply theorem 13.22 (corollary) to 
f(z) f(wz)...f(w"z) and then use exercise 12(a), 13. 


6. Write M = M(r), M, = M(r), M2 = M(r,); p, g integers. 
r?M? < max (r?M3, r?M9). 
r? Mé < r? M$ if and only if | 


(1) Ls log (M,/Msz) 
q__ log(re/ry) 
With such p, q we have r?M? < r3Mj, i.e. 
(2) log M < loss +log Mz. 


The infimum of the p/g satisfying (1) is 
log (M;/M2) 
log (r2/r1) : 
(2) holds if p/g is replaced by this infimum. 


7, 2 f{@) = Lidn2", then i,(r) = py Pr Le ae 


(i) I(r )2 

(ii) For ae 13.22, suppose f(z) = Xa,(z—Zo)” is regular in B(Zo, R) and 
|f| has a maximum at Zp. 

Then, if r is small enough, 


lao|? = |f(Zo)|? = U|an|?r?”. 
Hence a, = 0 forn 2 1. 
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(iii) Write u = log rand J(u) = I,(e") = 2 \a,|?e?™ 


d? I,J” —J” 
ee ~~. log Jo = — 


Numerator is 
(D|a,|? e?") (Za, |24n2e?) — (X|an|22ne2")2 > 0 (Schwarz). 


13 (c) 
1. The last three. tan $77z has poles when z is an odd integer. 


2. | f(r exp (27ip/2"))| +00 asr>1-. 


3. By theorems 12.13, 13.13, 


= fe 1({ am \t 
[ ere’ ai = 5 lop (principal value). 


m(x+ ai) 


Eeeas 
om I e-v dt converges. 
0 


Write r? = x?+y*. re(r.h.s.) =;{ 


Analogue for integrals of Abel’s limit theorem gives 


waaay i fe 
ar? 3 ay 


{ cos yf?dt = lim = | 
0 r—>0 2 


ee - 


4. If false, there is p for — rs (z—p)’ ea for z= 1+dé> 1. The 


series is 


y ee) ¥ n(n—1)...da—v+1)a,p"~’. 


v=0 v! 
Terms are positive. pyp> is > beni pP)+p}", which therefore converges for 
z = 1+0. This contiadices z= oe “3 


5. No generality is lost by operating only with discs having rational centres and 
radii. z* with irrational a, log z. 3 


6. First part from theorem 13.11. 


pe ee oe © (1 —2!-*) &(z) = \(—1)""1*n-~. 
1 
1 
Asz7>l, Oa a > log oes 5 af. 


7. If 6 is small enough, the rectangle with corners —éd+i6, 1+6+/i0 is in D. 
Let 0 < b—a. 

Let C, be the disc B(4nd; 6) for n = 0, 1,.... 

Let a < 4ké < b. f and all its derivatives are real at x; = 4k0, y = 0. From 
Taylor’s series in C;,, f and all its derivatives are real at x,_, and so, successively, 
at AKABy +009 XG 
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ow a= 4, b = 3, f(z) = value of (Z- 1)? which is positive 
fort} <x<i,y=0. 

D = a simply- connect region containing z = 0 and segment [3, 2] of y= = {) 
and excluding z = 


14 (a) 
1. Integrate round boundary of sector |z| < R,0 < phz< af 


2. (7/n) cosec (7/n). Without evaluation, 


1 ‘oer, 1 
ee i= [BS < [ae 
0 0 n 


o 1+ x” 1+x" +1’ 
and 0< { oa < { = = oa 
+ itx’ oo wet 
3. Validity for a = b can be proved from theorem 8.73. 
Write 


Had = | carainerTDy 
Suppose a is fixed and (say) 3a <b < 3q, Integrand <1/(x?+ a)? (x?+ 4a’) 
and 2 of this exists. Hence the integral ¢(a, b) converges uniformly for 
4a<b pe} 3a and so is continuous for b = a. 
Alternatively (and more simply) we may note that (for fixed a), f(a, b) de- 


creases as b increases. Hence ¢(a,a—) > ¢(a, a) = $(a,at+). As ba, the 
first and last of these have the same limit 377/8a°. 


4. If k < 0, value is —47e*cos ka. If k = 0, f = 0. 


5. §11.2, example 3. Differentiate the factor 1/z. 


se- [el fae 20) fea 


6. | as in example 2. To prove r oes a =0|5 A) 
x 


0 
3 XY -|= oY] oe “i sin ax 
dx. 
ax” ; ee se 
8. Vertices ab, aB, Ab, AB. 
9. Ifyisz =e" (<6 < 2n), 


| 4 [ sigh Wfiae 
«4, 2z—kK)(kz—-1) 
Poles inside y at 0, k. 


If |k| > 1, put k = 5. |-z [a Examine k = +1. 


10. Method of 9. 277/n!, 0 
12. Rectangle of example 3, indented at 0, 277. 
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14. Example 4 (method (i)). 


[ © (log x)’— (log x+ 27) 


= dx = 27i {res (47i)+res (37i)} = 48-747. 
0 


16. Sector |z} < R, 0 < phz < iz. 
The integral for 1 io states that it is its own Fourier transform if the definition 
of §9.9 is widened to admit unbounded integrands. (1/,/x in x > 0.) 


cO 
17. By theorem 8.75, ¢’(x) = [ exp (— 4u?)(—u sin ux)du. Integrate by parts. 
0 


18. For the latter, integrate sin zu/(e**—1), where a = ,/(27), round the rect- 
angle 0, X, X+ 27i/a, 27i/a. Let X > ©. 
19. From theorem 9.95, if i |g(t)| dt exists and g satisfies conditions for the 


convergence of its Fourier series at t, then 


4{(e(t+)+e(t—)} = = ee du | g(T) cos u(t—7) 77. 
Since sin u(t—7) is an odd function of u, the right-hand side is 
1 a = . 
- E sie Se du { ee 2(T) e*—-” dr, 
Put g(t) = e-*f(t) if t > 0 and g(t) = O if t < 0. Then, when ¢ > 0, 


= U—-o a 


HAC )+SO-Y) = tim [ eovemtdu [” pene erirrar, 


The inner integral is F(a+ iu). 


14 (5) 


1. 2. 3. Method of §§14.2, 14.3, or derive from results obtained there; e.g. 
z—4 for z in §14.2 gives 
(@) 1 772 


A {z—(n+4}2_ cos?77z" 


Integrate, as in §14.3, to deduce 2. 


1 
4. Poles of s 


$(1 or w orw?). Take, as in § 14.2, 


7 cot 7¢ 
oe 8¢3 — 1 ds 


i are at z = 4(1 orw or w”) where w= e?”*® ; residues respectively 


and we have 
= 1 


=. 
> = Bai = cot 47+ w? cot 47w?),. Simplify. 
r=1 = 


8. (i) 4z coth 4z (which > 1 as z +0) has Taylor’s series in |z| < R, where R 
is distance of nearest singularity, namely + 277. 
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(iii) Equate coefficients of z®” in 


cosh 4z = (2/z) sinh 4z=B, 2"/n! 


i.e. in x a a> wail |2 oa?" 
2 5! 227(2n)! 4 22(2r+ IIo rn! J 
14 (c) 
; fo) co (—1)""? 
10. onright = >) ») @na, Where Qnn = 2. 
n=l1lm=!1 n 
Now » |@mn| = —log(i—|z|") < id <2|z|" if m > mo. 
n=1 i= a 
Hence ©) D> |amn| converges if |z| < 1. 
m=l1in=1 
By theorem 4.74, 
23 .S ann = >> a ann = = 2 log(1+z”). 
n=1m=l1 m=l1n=1 m=1 


Take exponential of each side. 
| a | iH 
11. cos 7z = 1 -5 : 
nail UE 


12. (i) False. a, = 1/nlogn. 


(-1)*" 1 = 
\n 7 = Poe 


(ii) False. a, = 


n 
(iii) True. [] tends to a finite limit, zero or non-zero according as Xa? diverges 
1 

or converges. 


13. (i) If p; is the Ath prime, 


m 1 —1 m co 
Il (1 --) = II (S2:") a a) n-#, 
k=1 P, k=1 \r=0 
where =“ means summation over all n not divisible by pms, .... 
As m-> 00, every integer m appears in &™ and r.h.s >€(z). 


(ii) If x > 1, Bp-* (a sub-series of Xn~*) converges absolutely. Hence product 
converges, and so €(z) + 0. 


(iii) In (i) write z = 1. As m> 00, 2™ n> Therefore II (1 — p=) diverges 
to 0. Then (theorem 14.41) X(1/p) diverges. 
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14. Let g be a function with simple zeros at the a, Then, for suitable "Fn 


Xe (2) 2 EXP {9n(Z— an} 


8 (an) (Z— rn) 
converges. g, = real mudtion: of dp. 


15. (Gj) Theorem 14.51. 
(ii) As for sx/sx in theorem 14.52. 
(iii) Theorem 13.11. 
(iv) (a) (—z) is got from go(z) by rearranging terms of an absolutely 
convergent series. 
(b) Similar. 
(c) A'(zZ+ 20) = —22(z—OQ,, n+ 2;)-*. The sets of points Qin,n— 2, and 
Qmn are the same. The series for ’(z+2w,) is a rearrangement of that 


for go’(z). 
(d) Integrating (c), (z+ 2,) = (z)+ A. z = —0, gives A = 0. 
1 1 
se ge OF 
(v) (z)—z fee Go, i | (even function) 
2 4 
= 2’ cere if |z| small enough 


A, Z* + a3Z4+ O({|z]). 


Thus {(z)}* = J O(\z|), 
zZ 2 


io4s) = <- <a 16a3+ O(|z}). 


Hence {90’(z)}? — 4{,0(z)}? + 20a, (0(z) + 28a, = O(|z]). 


Expression on left is an elliptic function, regular for all z and O for z = 0, 
hence = 0 (Liouville). 
The coefficients g,, g, are 60L’Q-4 . 140 8 | Rieger 


m,n? 


14 (d) 
1 to 5 follow from § 14.6 (5), e.g. 
fe 1 ptt Dato) (nto%) _ 1 
— p27i/3 rae Se eS 
2. Ifw=e I (1+) = 73 = Terres 
_ sin(—77) = cosh 47/3 
Ss 7 1 : 


5. Product is 


Z (n—4—42z)(n+4z) 
. (1 2 ) ( +) = nee (n—4)n : 


The rearranged product is 


(4n—z—3)(4n—z—1)(2n+z) | Ta) Td) 


Ni 


which simplifies to 22T(4—42)TU +42) 
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6. Pole at z = —n(n = 1,2, ...). 
; 3 (n+ z)T(n+z) (—1)"-1 
= i T = Se 

a we cae ee CEayy 


14 (e) 
—imjre) 1 ee Lal eee 
i. (i) T(a+b) G+khyek: (ii) T(a+b) eager. 6 
RODE Caer 
Zz (1) 4J(Qn)’ (ii) 4c2,/(2n)’ where c = Td). 


3. Integrate e~*z*-! round boundary of 6 < |z| < R, 0 < phz < 4z. 
. Theorem 14.72. 


nan bb 


. Integrate (z+ z-')*z’-! round the boundary of 6 < zi <1, -47 < phz < iy. 


co 
6. Integrand is #7“! }/e-™'. Justify inversion of { and . 
1 
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enumerable 11 

equating coefficients in power series 122 
essential singularity 390° 
Estermann 379 

Euler, on trigonometric series 335 
exponential function 352 
extended plane 347, 396 
extension of function 9 

extension theorem of Tietze 130 


factor theorem of Weierstrass, sine pro- 
duct 432 

filter 100 

fixed points 52, 350 

Flanders 378 

Fourier integral 330, transform 334, 423 

Fourier series 299-330 

Fréchet 31 

frontier point 30 

Frullani’s integral 194 

function element 408 
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